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AMPLIFICATION 

FIELD OF THE INVENTION 

The present invention relates generally to the field of recombinant DNA technology. 

5 

In particular, the present invention relates to improved methods for producing 
amplified heterogeneous populations of RNA from limited quantities of starting 
RNA. 

10 BACKGROUND TO THE INVENTION 

The field of gene expression profiling has exploded in the last few years, and can 
now be performed on a genome wide scale. Identification of differentially expressed 
genes is being used for medical, clinical, and biological research to help understand 
the molecular mechanisms that underlie biological processes including disease - 

15 such as tumourigenesis - differentiation and development. Gene expression profiling 
can be used across a broad range of applications, for example for the identification 
of novel targets for therapeutic intervention, identification of potential diagnostic 
and prognostic markers, to help understand clinical response to drugs and outcome, 
and identify toxicological responses. DNA arrays of immobilised gene-specific 

20 sequences (probes) on a variety of platforms - such as macroarrays, microarrays or 
high-density oligonucleotide arrays - on different solid supports (e.g., nylon 
membranes, glass slides or silicon/ceramic chips) are widely applicable in different 
areas of genomics research. 

25 The most commonly used mechanism for RNA amplification is a T7 based linear 
amplification method first developed by Van Gelder et al (1990), Eberwine et al 
(1992) and Philips and Eberwine (1996), and is described in US 6,291,170, US 
5,891,636, US 5,716,785, and US 5,545,522. In this procedure, a synthetic poly(dT) 
primer containing the phage T7 RNA polymerase promoter is used to prime 

30 synthesis of first strand cDNA by reverse transcription. Second strand cDNA is 
synthesised by limited RNase H digestion, followed by second strand synthesis with 
E. coli DNA polymerase I. Amplified antisense RNA (aRNA) is obtained from in 
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vitro transcription of the double-stranded cDNA template using T7 RNA 
polymerase. 

This procedure is the basis for Asymetrix GeneChip oligonucleotide arrays where 
5 biotinylated ribonucleotides are incorporated into the aRNA, also known as 
complementary RNA or (see Affymetrix GeneChip ® Expression Analysis 
Technical Manual, and Mahadevappa and Warrington (1999)), and is also the basis 
for glass microarrays (Pabon et al 9 (2001)). 

10 Although this method involves an amplification step with T7 RNA polymerase, it is 
typically used with 5-25|ig total RNA. However, in order to obtain sufficient 
amounts of amplified RNA (aRNA) for microarray experiments, investigators have 
resorted to two or more rounds of cDNA synthesis coupled with T7 amplification to 
generate a representative mRNA profile and this has allowed them to combine 

15 microdissection with array technology (Luo et al (1999), Ohyama et al (2000), 
Affymetrix Technical Note GeneChip ® Eukaryotic Small sample Target Labelling 
Assay Version II, and Luzzi et al (2001)). 

Wang et al (2000) have described a T7-based amplification protocol modified with a 
20 SMART™ template-switching primer that is used to theoretically generate a fiill- 

length double stranded cDNA. The fidelity of aRNA amplified was shown to be 

comparable between the expression profiles of 1:10000 to 1:100000 of commonly 
used input RNA, and those observed with conventional polyA+ RNA or total RNA. 
Hu and co-workers (2002) compared amplified and unamplified samples to evaluate 
25 a similar T7 based protocol with a template switching mechanism adopted from 
Wang et ah Their results showed concordance between amplified and non-amplified 
samples, and four expressed and two differentially expressed genes were verified 
using Northern and Western blotting and immxinohistochemical assay. 

30 Modifications of the T7-based amplification technology have also been used in two 
commercially available amplification kits. The RiboAmp™ RNA Amplification 
Kits (Arcturus) achieves high yields of amplified RNA with a proprietary linear 
amplification method. This method utilises one or two rounds of T7 based 
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amplification depending on the amounts of starting material. RealArt mRNA 
amplification kit (Artus GmbH) provides a T7 based amplification technology. 
mRNA is converted to cDNA with an anchored poly(dT) primer, and then second 
strand synthesis is performed using a proprietary "Box/randomized primer mix" 
5 which generates a random representation of the complete cDNA. The randomly 
primed cDNA is denatured and primed with a T7 promoter/poly(dT) primer. This 
leads to double stranded cDNA with a T7 promoter at one end that can be used as a 
template for in vitro transcription. 

10 Polymerase chain reaction (PCR) is an extremely powerful technique for amplifying 
specific nucleic acid sequences as described in US 4,683,202 and US 4,683,195. 
PCR typically comprises treating separate complementary strands of a target DNA 
with two oligonucleotide primers to form complementary primer extension products 
on both strands that can act as templates for synthesising copies of the desired 

15 nucleic acid sequences. The separation and synthesis steps are repeated in an 
automated system using thermostable polymerases, essentially exponential 
duplication of the target sequences can be achieved. 

Amplification of cDNA by PCR requires the presence of primer binding sites at both 
20 cDNA ends. These primer sites can be attached by a number of methods, including 
a) the addition of a homopolymer tail on the 3' end of the first-strand cDNA 
(Akowitz & Manuelidis, (1989), Belyavsky et al, (1989), Domec, et al (1990), and 
Brady, et al (1990)); b) ligation of single-stranded anchor oligonucleotide to the 3' 
end of the first-strand cDNA (Apte & Siebert, (1993)); c) ligation of single-stranded 
25 anchor oligonucleotide sequence to the 5' end of mRNA (Fromont-Racine et al 
(1993), Kato, et al (1994), Maruyama & Sugano, S. (1994)); d) ligation of double- 
stranded adaptors to the 5 9 end of the double-stranded cDNA (Frohman & Dush, 
(1988)); and e) addition of amplifier sequence at the 5' and 3' ends of single- 
stranded cDNA in the first strand synthesis reaction by a template switching 
30 mechanism (SMART™, Switching Mechanism At the 5' end of RNA Transcripts, 
BD Biosciences Clontech) as described in Chenchik et al (1998) and Zhu, et al 
(2001)). Template switching is also described in US 5,962,271 and US 5,962,272. 
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Amplification techniques based on reverse transcription polymerase chain reaction 
(RT-PCR) have been described for global amplification of mRNAs from single cells 
(Rappolee, et al (1989), Brady, et al (1990), Cheng, et al (1996), O'Brien, et al 
(1994)). PCR based approaches such as that described by Brady et al (1990) have 
5 also been employed to look at gene expression by global mRNA amplification 
followed by cDNA array analysis (Theilgaard-Monch, et al (2001)). However, this 
method is designed to limit the size of the first strand cDNA to about 300-700 bases, 
and so corresponding arrays have to be designed such that the arrayed clones contain 
the 3 'ends of each cDNA. 

10 

Recently, a rapid and highly optimized global RT-PCR procedure has been 
described (Iscove, et al (2002)) that is also based on methods described previously 
by Brady et al (1990). The procedure involves reverse transcription of a first cDNA 
strand primed by poly(dT), addition of an poly(dA) tail with terminal transferase, 

1 5 and subsequent exponential amplification using a single poly(dT)-containing primer. 
Reverse transcription is limited to only a few hundred bases of extreme 3 ! sequence 
by limiting deoxynucleotide concentrations and the time of the reaction. These 
conditions were intended to provide a more uniform likelihood of sampling 
individual mRNA transcripts and more uniform amplification efficiency across all 

20 cycles. This global RT-PCR approach preserved abundance relationships through 
amplification and yielded reproducible results from the picogram range of total RNA 
obtainable from single cells. 

A PCR-based amplification method, three prime end amplification PCR (TPEA- 
25 PCR), has been developed which results in global amplification of the 3' end of all 
the mRNAs present in a sample (Dixon, et al (1998)). PCR amplification occurs 
between primers incorporated into the first strand cDNA during reverse 
transcription, and a second strand primer that has a partially degenerate 3' end and is 
designed to anneal once every kilobase. 

30 

Recently, another PCR-based amplification method balanced PCR" was described 
by Makrigiorgos, et al (2002). Balanced PCR allows a balanced amplification of 
fragments from two complex DNAs even after three sequential rounds of PCR. Two 
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distinct genomic DNA or cDNA samples are tagged with oligonucleotides 
containing both a common and a unique DNA sequence. The samples are pooled and 
amplified in a single PCR tube using the common DNA tag, and therefore there 
should be no differences in amplification efficiency. The PCR-amplified pooled 

5 samples can be separated using the DNA tag unique to each individual genomic 
DNA sample. The principle of this method has been validated with synthetic DNA, 
genomic DNA, and cDNA applied on microarrays. By removing the bias of PCR, 
this balanced PCR approach should allow genetic analysis in minute laser- 
microdissected tissues, paraffin-embedded archived material, or single cells. 

10 However, routine use of this method for amplification of RNAs from biopsies has 
not yet been reported. 

A PCR amplification method based on reverse transcription, followed by random 
PCR amplification of the cDNA and in vitro transcription of the resulting PCR 

15 product with T7 RNA polymerase has been described by Roche Applied Science in 
their Microarray Target Amplification kit (Cat No 3 310 191). In this procedure, 
nanogram quantities of total RNA are reverse transcribed into cDNA using a 
modified poly(dT) primer (TAS-T7 poly(dT)24 ). The unique Target Amplification 
Sequence (TAS) has no homology to any known human sequence, generates the 3* 

20 anchor on the cDNA for subsequent PCR amplification. The T7 promoter sequence 
is added to allow the generation of labelled cRNA targets by in vitro transcription. A 
TAS-(dN)10 primer is used in order to include a 5 ' amplifier sequence on the cDNA 
and is used for the initiation of the second strand cDNA synthesis. PCR is then 
performed with TAS-primer and the optimal numbers of cycles have to be 

25 determined. PCR products are then purified and used for in vitro transcription. 

SMART™ generated cDNA has been used for several applications including cDNA 
library construction, and as hybridisation probes for cDNA and oligonucleotide- 
based microarrays. SMART™ technology coupled with PCR has allowed the use of 
30 lower amounts of starting total RNA, although there is only limited data to support 
the usefulness of this technology in microarray analyses where it is important to 
maintain the mRNA representation [Spirin, et al (1999), Gonzales et al (1999), 
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Vernon, et al (2000), Livesey, et al (2000), Zhumabayeva, et al, (2001) and Fink, et 
al (2002)]. 

The limitation of these technologies is the requirement for relatively large amounts 
5 of intact total RNA. By way of example, typical microarray labelling procedures 
require 0.5-4 jig poly(A) + RNA or 5-50 pg total RNA per microarray. This 
amount of poly(A) + RNA or total RNA can be obtained from samples of tissue that 
weigh greater than 50-100 mg, however many samples are significantly less than 
this, for example many clinical biopsies. A recent pilot study by Assersohn et al 
10 (2002) showed that only 15% of fine needle aspirates (FNA) from human breast 
cancers produced sufficient mRNA for expression array analysis. 

A simple and robust method that is not restricted by low efficiency and the use of 
multiple time consuming steps and allows amplification from small amounts of 
15 RNA, whilst maintaining sensitivity, reproducibility, and can effectively identify 
differentially expressed genes is highly desirable in the art. 

SUMMARY ASPECTS OF THE PRESENT INVENTION 

The present invention combines the use of template switching with nucleic acid 
20 amplification - such as PCR - to generate amplification products representative of an 
entire RNA population. An RNA polymerase promoter sequence allows 
transcription-based amplification to be performed on the derived amplification 
products such that antisense amplified RNA (aRNA) or complementary RNA 
(cRNA) is produced for subsequent downstream applications. 

25 

Advantageously, the RNA generated in accordance with the present invention is 
antisense and therefore has utility for use on cDNA arrays and oligonucleotide 
arrays (spotted oligos or solid phase synthesised oligos such as Affymetrix). 
Commercial or "home-made", arrays are either cDNA or oligo based (typically sense 
30 oligos are arrayed) and therefore production of antisense cRNA as we describe 
herein has utility in either setting. If sense RNA was generated then its use would be 
limited to cDNA arrays and not the majority of most current oligonucleotide arrays. 
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The present invention also overcomes the obstacle of limited tissue samples. 

In a first aspect, the present invention relates to a cDNA-RNA hybrid comprising a 
first strand cDNA synthesis substantially hybridised to RNA wherein the cDNA 
5 comprises an amplifier sequence and an RNA annealing region operably linked to an 
RNA polymerase promoter, and wherein at least one non-templated nucleotide at the 
3* end of the first strand cDNA is hybridised to a template switching 
oligonucleotide. 

10 In a second aspect, the present invention relates to a method for amplifying RNA in 
a sample comprising the steps of: (a) providing a cDNA synthesis oligonucleotide 
comprising an amplifier sequence and an RNA annealing region operably linked to 
an RNA polymerase promoter; (b) annealing the RNA annealing region of the 
cDNA synthesis oligonucleotide to RNA under suitable conditions to produce a 

15 cDNA-RNA complex; (c) incubating said cDNA-RNA complex under conditions 
which permit template-dependent extension of the cDNA synthesis oligonucleotide 
to generate a cDNA-RNA hybrid; (d) contacting said cDNA-mRNA hybrid with a 
template switching oligonucleotide under conditions which permit template 
dependent extension of said cDNA, such that the 3' end of the cDNA comprises a 

20 sequence complementary to said template switching oligonucleotide; (e) providing 
an amplification primer under conditions to generate double stranded amplification 
products corresponding substantially to the first strand cDNA synthesis, such that 
the cDNA amplification products comprise a double stranded RNA polymerase 
promoter; and (f) incubating said cDNA amplification products comprising said 

25 double stranded RNA polymerase promoter under conditions that permit in vitro 
transcription to generate amplified RNA. 

In a third aspect, the present invention relates to a method for preparing an 
expression library of a cell or a cell population comprising the steps of: (a) providing 
30 a cDNA synthesis oligonucleotide comprising an amplifier sequence and an RNA 
annealing region operably linked to an RNA polymerase promoter; (b) contacting 
said cDNA synthesis oligonucleotide with a population of mRNAs from said cell or 
cell population under conditions to allow hybridisation of said cDNA synthesis 
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oligonucleotide to mRNA to produce a cDNA-mRNA complex; (c) incubating said 
cDNA-mRNA complex under . conditions which permit template-dependent 
extension of said cDNA synthesis oligonucleotide to generate a cDNA-mRNA 
hybrid; (d) contacting said cDNA-mRNA hybrid with a template switching 
5 oligonucleotide under conditions which permit template dependent extension of said 
cDNA, such that the 3' end of the cDNA of the cDNA-mRNA hybrid comprises a 
sequence complementary to said template switching oligonucleotide; (e) contacting 
an amplification primer with said cDNA-mRNA hybrid under conditions that 
generate double stranded amplification products corresponding to the first strand 
10 cDNA synthesis, such that the double stranded cDNA amplification products 
comprise a double stranded RNA polymerase promoter; . and (f) incubating said 
double stranded cDNA amplification products comprising said double stranded 
RNA polymerase promoter under conditions that permit in vitro transcription to 
generate amplified RNA. 

15 

In a fourth aspect, the present invention relates to a method of preparing a cDNA 
library from a collection of mRNA molecules comprising the steps of: (a) providing 
a cDNA synthesis oligonucleotide comprising an amplifier sequence and an RNA 
annealing region operably linked to an RNA polymerase promoter; (b) contacting 

20 said cDNA synthesis oligonucleotide with the collection of mRNAs under 
conditions to allow annealing of said cDNA synthesi s oligonucleotide to mRNA 
produce a cDNA-mRNA complex; (c) incubating said cDNA-mRNA complex under 
conditions which permit template-dependent extension of said cDNA synthesis 
oligonucleotide to generate a cDNA-mRNA hybrid; (d) contacting said cDNA- 

25 mRNA hybrid with a template switching oligonucleotide under conditions which 
permit template dependent extension of said cDNA of said hybrid, such that the 3' 
end of the cDNA of the cDNA-mRNA hybrid comprises a sequence complementary 
to said template switching oligonucleotide; (e) contacting a PCR primer with said 
cDNA-mRNA hybrid under conditions that generate double stranded amplification 

30 products corresponding to the first strand cDNA synthesis, such that the double 
stranded cDNA amplification products comprise a double stranded RNA polymerase 
promoter; (f) incubating said double stranded cDNA amplification products 
comprising said double stranded RNA polymerase promoter under conditions that 
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permit in vitro transcription to generate amplified RNA; and (g) preparing a cDNA 
library from the amplified RNA. 

In a fifth aspect, the present invention relates to a method for performing subtractive 

5 hybridisation comprising the steps of: (a) providing a' cDNA synthesis 
oligonucleotide comprising an amplifier sequence and an RNA annealing region 
operably linked to an RNA polymerase promoter; (b) contacting the cDNA synthesis 
oligonucleotide with a collection of mRNAs under conditions to allow annealing of 
said cDNA synthesis oligonucleotide to mRNA in said RNA sample to produce a 

10 cDNA-mRNA complex; (c) incubating said cDNA-mRNA hybrid with enzyme, 
dNTPs and buffer under conditions which permit template-dependent extension of 
said cDNA synthesis oligonucleotide to generate a cDNA-mRNA hybrid; (d) 
contacting said cDNA-mRNA hybrid with a template switching oligonucleotide 
under conditions which permit template dependent extension of said cDNA of said 

15 hybrid, such that the 3' end of the cDNA of the cDNA-mRNA hybrid comprises a 
sequence complementary to said template switching oligonucleotide; .(e) contacting 
an amplification primer with said cDNA-mRNA hybrid under conditions to generate 
double stranded amplification products corresponding to the first stand cDNA 
synthesis, such that the double stranded cDNA amplification products comprise a 

20 double stranded RNA polymerase promoter; (f) incubating said double stranded 
cDNA amplification products comprising said double stranded RNA polymerase 
promoter under conditions that permit in vitro transcription to generate amplified 
RNA; (g) contacting said amplified RNA with a single stranded nucleic acid 
population in the opposite sense to said amplified RNA; (h) providing for the 

25 hybridisation of the sequences present in the amplified RNA and the single stranded 
nucleic acid population; and (i) isolating the nucleic acid population that remains 
single stranded. 

In a sixth aspect, the present invention relates to a method for detecting the 
30 expression of a gene of interest comprising the . steps of: (a) providing a cDNA 
synthesis oligonucleotide comprising an amplifier sequence and an RNA annealing 
region operably linked to an RNA polymerase promoter, wherein the RNA 
annealing region comprises a sequence that is substantially homologous to the 
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mRNA expressed by the gene of interest; (b) contacting said cDNA synthesis 
oligonucleotide with a population of mRNAs in a cell or cell population under 
conditions to allow annealing of said cDNA synthesis oligonucleotide to mRNA to 
produce a cDNA-mRNA complex; (c) incubating said cDNA-mRNA hybrid under 

5 conditions which permit template-dependent extension of said cDNA synthesis 
oligonucleotide to generate a cDNA-mRNA hybrid; (d) contacting said cDNA- 
mRNA hybrid with a template switching oligonucleotide under conditions which 
permit template dependent extension of said cDNA of said hybrid, such that the 3' 
end of the cDNA of the cDNA-mRNA hybrid comprises a sequence complementary 

10 to said template switching oligonucleotide; (e) contacting an amplification primer 
with said cDNA-mRNA hybrid under conditions to generate double stranded 
amplification products corresponding to the first stand cDNA synthesis, such that 
the double stranded cDNA amplification products comprise a double stranded RNA 
polymerase promoter; (f) incubating said double stranded cDNA amplification 

15 products comprising said double stranded RNA polymerase promoter under 
conditions that permit in vitro transcription to generate amplified RNA; and (g) 
deter minin g the presence or absence of amplified RNA, which, amplified RNA is 
complementary to mRNA corresponding to the gene of interest. 

20 In a seventh aspect, the present invention relates to amplified RNA obtainable by the 
method according to the second aspect of the present invention. 

Advantageously, the amplified RNA could be used in a diagnostic, prognostic or 
predictive test starting with small limiting amounts of biological or clinical samples, 
25 for example but not limited to, biopsies, fine needle aspirates, tissue sections, 
bronchioalveolar lavage, macrodissected or microdissected tissues. 

In an eighth aspect, the present invention relates to an expression library obtainable 
by the method according to the third aspect of the present invention,. 

30 

In an ninth aspect, the present invention relates to a cDNA library obtainable by the 
method according to the fourth aspect of the present invention.. 
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In a tenth aspect, the present invention relates to the use of a cDNA-RNA hybrid 
according to the first aspect of the present invention in the amplification of RNA. 

In an eleventh aspect, the present invention relates to the use of a cDNA-RNA 
5 hybrid according to the first aspect of the present invention in the preparation of a 
cDNA library. 

In an twelfth aspect, the present invention relates to the use of a cDNA-mRNA 
hybrid according to the first aspect of the present invention in subtractive 
10 hybridisation. 

In an thirteenth aspect, the present invention relates to the use of a cDNA-mRNA 
hybrid according to the first aspect of the present invention for measuring gene 
expression. 

15 

In an fourteenth aspect, the present invention relates to a kit for the amplification of 
RNA in a sample comprising: (a) a cDNA synthesis oligonucleotide comprising an 
amplifier sequence and an RNA annealing region operably linked to an RNA 
polymerase promoter; (b) a template switching oligonucleotide that has substantially 
20 the same sequence as the amplifier sequence; and (c) an amplification primer that 
has substantially the same sequence as the template switching oligonucleotide. 

Other aspects of the present invention are presented in the accompanying claims and 
in the following description and discussion. These aspects are presented under 
25 separate section headings. However, it is to be understood that the teachings under 
each section heading are not necessarily limited to that particular section heading. 

PREFERRED EMBODIMENTS 

Preferably, the RNA of the cDNA-RNA hybrid is mRNA. 

30 

Preferably, the RNA polymerase promoter is a bacteriophage promoter. More 
preferably, the bacteriophage promoter is selected from the group consisting of T7, 
T3 and SP6. 
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Preferably, the RNA annealing region comprises poly (dT). More preferably, the 
oligo(T) region is from about 10 to about 30 T residues in length. 

5 Preferably, the 3' end of the RNA annealing region comprises a VN clamp (VN-3'), 
wherein V is A, G or C and N is A, G, C or T. 

Preferably, at least one non-templated nucleotide at the 3' end of the first strand 
cDNA synthesis is deoxycytidine. 

10 

Preferably, at least three non-templated nucleotide at the 3' end of the first strand 
cDNA synthesis are hybridised to a template switching oligonucleotide. 

Preferably, at least three of the non-templated nucleotides at the 3' end of the first 
15 strand cDNA synthesis are deoxycytidine nucleotides. 

Preferably, the template switching oligonucleotide has at least three guanine residues 
at its 3' end. 

20 Preferably, the amplifier sequence, amplification primer and the template switching 
oligonucleotide contain the same sequence. 

Preferably, the 3' end of the first strand cDNA synthesis is extended such that it is 
substantially complementary to the template switching oligonucleotide. 

25 

Preferably, the first strand cDNA synthesis is synthesised by a reverse transcriptase. 

Preferably, the reverse transcriptase lacks RNaseH activity but retains wild-type 
polymerase activity. More preferably, the reverse transcriptase is a Moloney Murine 
30 Leukemia virus (MMLV) reverse transcriptase or a mutant thereof. Most preferably, 
the reverse transcriptase is PowerScript™ Reverse Transcriptase (BD Biosciences 
Clontech). 
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Preferably, the cDNA-RNA hybrid is incubated with a reverse transcriptase that 
adds at least one deoxycytidine residue to the 3' end of the first strand cDNA 
synthesis. 

5 Preferably, the reaction comprises 1 mM dNTPs. 

Preferably, the double stranded amplification products are obtained by PCR. 

Preferably, the cDNA synthesis oligonucleotide and the PCR primer have the same 
10 concentration. 

Preferably, the cDNA synthesis oligonucleotide and the PCR primer have a 
concentration of about 0.5 jiM. 

15 Preferably, PCR amplification is performed using the Advantage® 2 Polymerase 
mix (BD Biosciences Clontech). 

Preferably, the optimum number of cycles to generate the double stranded 
amplification products is determined by a method comprising the steps of: (a) 

20 providing a plurality of samples with a known amount of RNA; (b) performing 
amplification for a defined number of cycles on the plurality of samples; (c) 
purifying the double stranded amplification products; (d) providing for the in vitro 
transcription of the purified amplification products; and (e) determining the number 
of amplification cycles that results in the minimum amount of amplified RNA that is 

25 required. 

Preferably, the RNA sample is a clinical sample selected from the group consisting 
of a biopsy, a microdissected tissue, a fine needle aspirate, a flow-sorted cell, a laser 
captured microdissected cell or a single cell. . 

30 

Preferably, gene expression is measured using a microarray. 
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Preferably, the kit according to the fourteenth aspect of the present invention further 
comprises in a separate container a reverse transcriptase. 

Preferably, the kit further comprises in a. separate container an RNA polymerase 
5 specific to the RNA polymerase promoter of the cDNA synthesis oligonucleotide. 

Preferably, the kit further comprises an amplification buffer and one or more 
amplification enzymes. More preferably, the amplification buffer and the 
amplification enzyme(s) are PCR amplification buffer and PCR amplification 
10 enzyme(s). 

Preferably, the kit further comprises a control nucleic acid. . 
ADVANTAGES 

15 The present invention has a number of advantages. These advantages will be 
apparent in the following description. 

By way of example, the present invention is advantageous since it provides a 
commercially useful method. 

20 

By way of further example, the present invention is advantageous since the method 
of the present invention is technically simpler and faster than alternative 
amplification methods, but with equivalent or improved performance over such 
methods. 

25 

By way of further example, the present invention is advantageous since the present 
invention provides a method for the reproducible and robust amplification of small 
amounts of total RNA (typically 5ng-50ng or less), or the approximate equivalent of 
500-5000 cells, with the possibility of further scope to use even lower amounts. 

30 

By way of further example, the present invention is advantageous since it provides a 
novel method for the amplification of limited amounts of RNA while maintaining 
the relative representation of mRNAs. 




101162 



-15- 



BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows a representation of the amplification method according to the present 
invention. 

5. 

Step 1 

Incubating a sample of poly(A)+RNA or total RNA in the presence of a cDNA 
synthesis oligonucleotide which can anneal to RNA. 

10 This oligonucleotide has at the 5' end an amplification primer - such as a PCR 
primer - followed by a T7 promoter sequence and an RNA annealing region. The 
amplification primer generates the 3* anchor on the cDNA for subsequent 
amplification and the T7 promoter sequence allows the generation of amplified RNA 
targets by in vitro transcription. An enzyme possessing reverse transcriptase activity 

15 is included under suitable conditions to generate an RNA-cDNA hybrid and a 
template switching oligonucleotide is also included which can provide CAP- 
dependent extension of full-length cDNA by reverse transcriptase using the template 
switching oligonucleotide as a template, and thereby adding sequence 
complementary to the template switching oligonucleotide to the 3 -end of full-length 

20 cDNA. 

Step 2 

Incubating anchored cDNA:RNA hybrid (ie the population of full-length cDNAs) 
generated at step 1, with a single amplification primer corresponding substantially or 
25 completely to the sequence of the template switching oligonucleotide. 

Conditions suitable to generate amplification products are used, the amplification 
products corresponding to the population of cDNAs present. Advantageously, the 
amplification conditions are pre-determined to result in sufficient amounts of 
30 amplified product for downstream applications but with the minimal number of 
amplification cycles to minimise any amplification bias. Amplification products are 
then purified by standard procedures. 
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Step 3 

The purified amplification products obtained undergo transcription-based 
amplification using T7 polymerase to generate amplified RNA (aRNA), also known 
as complementary RNA (cRNA), for downstream applications. The T7 promoter 
5 region that is incorporated into the cDNA synthesis oligonucleotide is capable of 
inducing transcription by T7 polymerase. 

Figure 2 shows a comparison of cRNA yields. 

10 Figure 3 illustrates quality control metrics across arrays. RawQ, BG log and Scaling 
factor values are shown for each GeneChip hybridisation. 

Figure 4. Top: 3' to 5* ratios for the beta-actin (x00351) probe sets. Middle: 3' to 5' 
ratios for the GAPDH (m33197) probe sets. Bottom: The number of probe sets 
15 called Present as determined by the Affymetrix MAS 5.0 algorithm. 

Figure 5 shows the detection of 5% middle, and 3' ends of polyA+ spikes. Top: Lys 
transcript spiked in at lpM. Middle: Phe transcript spiked in at 5pM. Bottom: Thr 
transcript spiked in at 20pM. Each graph shows the signals for the 5 5 end , middle, 
20 and 3' end probe sets for the polyA+ spikes. 

Figure 6 represents a clustering dendrogram using Pearson correlation. 

Figure 7 illustrates scatter plots (log scale) for -22,000 genes. 

25 

Figure 8 shows Spotfire scatter plots showing genes changing more than 2 fold (> | 
logio 0.3 |) between samples. 

Figure 9 represents scatter plots comparing log ratios to those of Set A ("9}ig 384 
30 non amp repl versus 9pg 842 non amp repl")- Background noise has been removed 
from plots i.e. where a signal of <100 was reported for a particular gene across all 
chip hybridisations. Pearson correlation values are given for each comparison. 
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Figure 10 shows Cluster histograms. Upper histogram shows all genes in all clusters 
. [Red = genes changing <2 fold, Blue = genes changing >2 fold]. Lower histogram 
shows only the genes changing >2 fold (y axis enlarged). Numbers on the columns 
indicate how many genes are in each of the first 20 clusters. 

5 

Figure 1 1 shows example clusters [Red = genes changing <2 fold, Blue = genes 
changing >2 fold]. 

Figure 12 shows 70 k-means clusters [Red = genes changing <2 fold, Blue = genes 
10 changing >2 fold]. 

Figure 13 shows 70 k-means clusters [Red = genes changing <2 fold]. 

Figure 14 shows 70 k-means clusters [Blue = genes changing >2 fold]. 

15 

DETAILED DESCRIPTION OF THE INVENTION 
OLIGONUCLEOTIDE 

The oligonucleotides according to the present invention may be DNA, RNA, 
chimeric mixtures or derivatives or modified versions thereof that are modified at 
20 the base moiety, sugar moiety or backbone and may include other appending groups 
or labels, so long as they are still capable of functioning in the desired reaction. 

By way of example, the oligonucleotide may be a cDNA synthesis oligonucleotide 
comprising DNA. 

25 

By way of further example, the oligonucleotide may be a template switching 
oligonucleotide comprising DNA. 

In a highly preferred embodiment, the template switching oligonucleotide is the 
30 template switching oligonucleotide described in US 5,962,271 and 5,962,272. 

By way of further example, the oligonucleotide may be an amplification primer — 
such as a PCR primer - comprising DNA. 
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The oligonucleotides according to the present invention may be modified so long as 
they are still capable of functioning in the desired reaction. 

5 The oligonucleotides may be modified at the base moiety, sugar moiety, or 
phosphate backbone, and may include other appending groups or labels. 

The oligonucleotides may comprise at least one modified phosphate backbone - 
such as phosphorothioate, a phosphorodithioate, a phosphoramidothioate, a 
10 phosphoramidate, a phosphordiamidate, a methylphosphonate, an alkyl 
phosphotriester, and a formacetal or an analogue thereof. 

The oligonucleotides may be derived by cleavage of a larger nucleic acid fragment 
using non-specific nucleic acid cleaving chemicals or enzymes or site-specific 
15 restriction endonucleases; or by synthesis by standard methods known in the art, e.g. 
by use of a commercially available automated DNA synthesiser and standard 
phosphoramidite chemistry. 

Once the desired oligonucleotide is synthesised, it is cleaved from a solid support on 
20 which it was synthesised and treated, by methods known in the art, to remove any 
protecting groups present. The oligonucleotide may then be purified by any method 
known in the art, including extraction and gel purification. The concentration and 
purity of the oligonucleotide may be determined by, for example, examining the 
oligonucleotide on an acrylamide gel, by HPLC, or by measuring the optical density 
25 at 260 nm in a spectrophotometer. 

In a preferred embodiment of the present invention, the oligonucleotide is a cDNA 
synthesis oligonucleotide comprising the sequence set forth in SEQ ID No 1 : 

30 5'AAGCAGTGGTATCAACGCAGAGTGGCCAGTGAATTGTAATACGACTCA 
CTATAGGGAGGCGG(T) 30 VN-3' 

wherein V is A, G , or C andN is any base. 
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In another preferred embodiment, the oligonucleotide is a cDNA synthesis 
oligonucleotide comprising the sequence set forth in SEQ ID No 4: 

5 5 ' AAGCAGTGGTATCAACGCAGAGTAATACGACTCACTATAGGGAGA(T) 2 4 
VN-3' 

wherein V is A, G , or C and N is any base. 

10 In yet another preferred embodiment of the present invention, the oligonucleotide is 
a template switching oligonucleotide comprising the sequence set forth in SEQ ID 
No. 2: 

S'-AAGCAGTGGTATCAACGCAGAGTACGCGGG-S 1 

15 

In still another preferred embodiment, the oligonucleotide is an amplification primer 
comprising the sequence set forth in SEQ ID No. 3: 

5 f -AAGCAGTGGTATCAACGCAGAGT-3 , 

20 

The oligonucleotides corresponding to SEQ ID Nos 2 and 3 have been described in 
US 5,962,271 and 5,962,272. 

In the context of the present invention, the cDNA synthesis oligonucleotide 
25 comprises an amplifier sequence and an RNA annealing region operably linked to an 
RNA polymerase promoter. 

AMPLIFIER SEQUENCE 

The amplifier sequence in the context of the present invention relates to a portion of 
30 the cDNA synthesis oligonucleotide that contains the same or substantially the same 
sequence as the template switching oligonucleotide and the amplification primer, as 
described herein. 
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The amplification primer is able to hybridise to the complementary sequence of the 
amplifier sequence, such that the second strand cDNA synthesis may be amplified 
by, for example, PCR. 

5 Preferably, the amplifier sequence is located at the 5' end of the cDNA synthesis 
oligonucleotide and generally, will not be translated into RNA. 

A person skilled in the art will appreciate that any sequence may be used for the 
amplifier sequence as long as the amplification primer is able to hybridise or 
10 substantially hybridise to the complementary sequence of the amplifier sequence. 

The substantially identical template switching oligonucleotide and the amplifier 
sequence at both ends of the cDNA serve as universal priming sites for end-to-end 
amplification of the cDNA population. 

15 

In a preferred embodiment, the amplifier sequence, the amplification primer and the 
template switching oligonucleotide contain the substantially the same sequence - 
such as regions with the same sequence. 

20 In another preferred embodiment, the amplifier sequence comprises the sequence set 
forth in SEQ ID No. 3. 



RNA ANNEALING REGION 

As used herein, the term "RNA annealing region" refers to a portion of the cDNA 
25 synthesis oligonucleotide that is able to anneal to RNA. 

For many applications, it is desirable to preferentially enrich for one type of RNA 
with respect to other cellular RNAs, such as messenger RNA (mRNA), transfer 
RNA (tRNA) and ribosomal RNA (rRNA). Advantageously, most mRNAs contain 
30 a poly(A) tail at their 3 ! end which allows them to be enriched by affinity 
chromatography, for example, using oligo(dT) or poly(U) coupled to a solid support 
- such as cellulose or Sephadex (Ausubel et aL, eds., 1994, Current Protocols in 
Molecular Biology, vol. 2, Current Protocols Publishing, New York). 
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Accordingly, in a preferred embodiment of the present invention, the RNA 
annealing regions comprises poly(dT). Preferably, the poly(dT) is a 
polythymidylate region comprising about 10 to 30, preferably about 15 to 25, most 
5 preferably about 20 T residues, which bind with the poly(A) tail present on the 3' 
terminus of each mRNA. 

If more sequence information is available for a given RNA, then the RNA annealing, 
region may be designed more specifically to hybridise with a more specific 
10 population of RNA. Moreover, the RNA annealing region may comprise a 
collection of RNA annealing regions. 

Also, if there is ambiguity in the sequence information, a number of RNA annealing 
regions may be present. Thus, by way of example, when several possible nucleic 
15 acid sequences that encode a protein could be correct based on the protein sequence, 
a collection of RNA annealing regions containing sequences representing most or all 
of the possible codon variations may be prepared. 

If the sequence information for the desired RNA is known, the RNA annealing 
20 region need not reflect the exact sequence of the RNA, and can be "degenerate". 
Non-complementary bases or longer sequences can be interspersed into the RNA 
annealing region, provided that the RNA annealing region has sufficient 
complementarity with the sequence of the strand to . be amplified to permit 
hybridisation. 

25 

Typically, the RNA annealing region is located at the 3* end of the cDNA synthesis 
oligonucleotide and is operably linked to the RNA polymerase promoter. 

The term "operably linked" refers to a juxtaposition wherein the RNA annealing 
30 region and the RNA polymerase promoter are in a relationship permitting them to 
function in their intended manner. Thus, in the context of the present invention, the 
RNA annealing region and the RNA polymerase promoter are in a relationship that 
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permits the RNA annealing region to be expressed from the RNA polymerase 
promoter. 

Advantageously, the 3' end of the RNA annealing region may comprise one or more 
5 nucleotides that assist in the priming of mRNA. Preferably, the nucleotides 
comprise a VN clamp, wherein V is A, G or C and N is A, G, C or T. 

RNA POLYMERASE PROMOTER 

Promoter sequences are regions where RNA polymerase binds tightly to DNA and 
10 contains the start site and signal for RNA synthesis to begin. 

The RNA polymerase promoter will usually comprise between about 15 and 250 
nucleotides, preferably between about 15 and 60 nucleotides, most preferably 
between about 15 and 40 nucleotides, from a naturally occurring RNA polymerase 
15 promoter or a consensus promoter region (Alberts et al., in Molecular Biology of the 
Cell, 2d Ed., Garland, N.Y. (1989)). Native strong promoters typically contain two 
highly conserved DNA sequences, each about six nucleotides long, which are 
located upstream from the start site and separated from each other by about 17 
nucleotides of unrecognised DNA. 

20 

The RNA pol^qnerase that is used for transcription must be capable of binding to the 
particular RNA polymerase promoter region that is present in the eDNA synthesis 
oligonucleotide according to the present invention. In practice, any combination of 
RNA polymerase and RNA polymerase promoter may be used as long as the 
25 polymerase has sufficient specificity for that promoter to initiate in vitro 
transcription. 

The promoter may be a prokaryotic or a eukaryotic promoter. Preferably, the 
promoter is a prokaryotic promoter. More preferably, the prokaryotic promoter, is a 
30 phage or virus promoter. Most preferably, the RNA polymerase promoter is a 
promoter derived from a bacteriophage, for example, T3, T7 or SP6 polymerase 
(Chamberlin and Ryan, in The Enzymes, ed. P. Boyer (Academic Press, New York) 
pp. 87-108 (1982)). 
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A typical sequence of the T3 RNA polymerase promoter is: 
5' GCATTAACCCTCACTAAC 3' (SEQ ID No. 5) 

5 

A number of variant T3 promoter sequences are also known, especially those in 
which the first three bases of the non-template strand (shown above) are 5' TTA 3\ 
rather than AAA. See for example, US 5,037,745. 

10 A typical sequence of the T7 RNA polymerase promoter is : 

5 1 TAATACGACTCACTATA 3' (SEQ ID No. 6) 

A number of variant forms of T7 RNA polymerase are also known in the art. By 
15 way of example only, further variants of the T7 RNA polymerase promoter are: 

5' AATACGACTCACTATAGGGAGA 3' (SEQ ID No. 7) 

and 

20 

5' GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG 3' (SEQ ID 
No. 8) 

A typical sequence of the SP6 RNA polymerase promoter is: 

25 

5' ATTTAGGTGACACTATA 3' (SEQ ID No. 9) 

The RNA polymerase promoter may be a hybrid promoter. 

30 The promoter may additionally include features to ensure or to increase the level of 
expression. 
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The most preferred promoter is a T7 RNA polymerase promoter. The very high 
degree of specificity shown by T7 RNA polymerase for its promoter site 
(Chamberlin et aL, in The Enzymes, ed. P. Boyer (Academic Press, New York) pp. 
87-108 (1982)) has made this enzyme a widely used reagent in a variety of 

5 recombinant DNA techniques. The natural T7 promoters share a highly conserved 
sequence covering about bp -17 to about +6 relative to the start of the RNA chain 
(Dunn and Studier, J. Mol. Biol. 166: 477-535 (1983) and J. Mol. Biol. 175: 1 1 1-1 12 
(1984)). The lack of efficient termination signals for T7 polymerase also enable it to 
make transcripts from almost any DNA (see, Rosenberg et al., Gene 56: 125-135 

10 (1987)). 

RNA polymerases are widely available from a number of commercial sources - such 
as Promega Corporation, Ambion Inc, Enzo Diagnostics Inc., Epicentre 
Technologies. 

15 

The RNA polymerase promoter may be single stranded or double stranded. 
Advantageously, following the PCR amplification step of the method described 
herein, the promoter becomes double stranded. 



20 In a highly preferred embodiment, the orientation of the RNA polymerase promoter is such 
that antisense aRNA is expressed. Advantageously, antisense aRNA may be used in cDNA 
arrays and oligonucleotide arrays — such . as spotted oligonucleotides or solid phase 
synthesised oligonucleotides eg. Affymetrix. Commercial or "home-made" arrays are either 
cDNA or oligonucleotide based (typically sense oligonucleotides are arrayed) and therefore 

25 production of antisense cRNA as we describe herein has utility in either setting. 



TEMPLATE SWITCHING 

Template switching refers to the process of template-dependent synthesis of the 
30 complementary strand by an enzyme - such as reverse transcriptase - using two 
templates in consecutive order and which are not covalently linked to each other by 
phosphodiester bonds. 



101162 

- 25 - 

The process of template switching is described in Chenchik, et al (1998), Clark, 
(1988) and Hu & Temin, (1990), US 5,962,271 and US 5,962,272. 

Template switching in the context of the present invention is achieved by utilising 
5 two of the intrinsic properties of reverse transcriptase namely the ability to add non- 
templated nucleotides to the 3' end of the first-strand cDNA, and the ability to 
switch templates. 

Preferably, the template switching oligonucleotide comprises more than one residue 
10 — at its 3' end that base pairs with the complementary residues that are added at the 
3' end of the cDNA. More preferably, the template switching oligonucleotide 
comprises at least two residues at its 3' end that base pairs with the complementary 
residues that are added at the 3' end of the cDNA. Most preferably, the template 
switching oligonucleotide comprises at least three, residues at its 3' end that base 
15 pairs with the complementary residues that are added at the 3' end of the cDNA. 

Preferably, at least one of the residues at the 3' end of the template switching 
oligonucleotide comprise deoxyguanindine nucleotides. More preferably, at least 
two of the residues at the 3 s end of the template switching oligonucleotide comprise 
20 deoxyguaoindine nucleotides. More preferably, at least three of the residues at the 
3* end of the template switching oligonucleotide comprise deoxyguanindine 
nucleotides. Most preferably, three of the residues at the 3' end of the template 
switching oligonucleotide comprise deoxyguanindine nucleotides. 

25 Reverse transcriptase replicates to the 5' end of the mRNA then switches templates 
and continues to replicate to the end of the template switching oligonucleotide. The 
resulting first-strand cDNA synthesis contains the complete 5' ends of the mRNA's 
as well as sequences complementary to the template switching oligonucleotide. 

30 The substantially identical template switching oligonucleotide and the amplifier 
sequence at both ends' of the cDNA serve as universal priming sites for end-to-end 
cDNA amplification of the cDNA population. 
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A person skilled in the art will appreciate that any sequence may be used for the 
template switching oligonucleotide as long as it is identical or substantially identical 
to the amplification primer. Accordingly, in a preferred embodiment of the present 
invention the template switching oligonucleotide has substantially the same 
5 sequence as the amplifier sequence. 

The 3' end of the first strand cDNA synthesis is extended such that it is 
complementary or substantially complementary to the template switching 
oligonucleotide. Advantageously, an amplification primer is used that is able to 
10 hybridise to the sequence that is complementary to the template switching 
oligonucleotide. A DNA polymerase is then able to extend from the 3* end of the 
hybridised or substantially hybridised amplification primer, thereby resulting in a 
second strand cDNA synthesis. 

15 cDNA-RNA HYBRID 

In the context of the present invention, a cDNA-RNA hybrid refers to a hybrid that 
is formed between RNA and a first-strand cDNA synthesis in which the cDNA is 
extended such that the first-strand cDNA synthesis is complementary or 
substantially complementary to the RNA. 

20 

Typically, at least one non-templated nucleotide at the 3' end of the first strand 
cDNA is hybridised to a template switching oligonucleotide in the cDNA-RNA 
hybrid. 

25 Preferably, at least one non-templated nucleotide at the 3' end of the first strand 
cDNA synthesis is deoxycytidine. More preferably, at least two non-templated 
nucleotides at the 3' end of the first strand cDNA synthesis are hybridised to a 
template switching oligonucleotide. More preferably, at least three non-templated 
nucleotides at the 3' end of the first strand cDNA synthesis are hybridised to a 

30 template switching oligonucleotide. 

Preferably, at least one of the non-templated nucleotides at the 3' end of the first 
strand cDNA synthesis are deoxycytidine nucleotides. More preferably, at least two 
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of the non-templated nucleotides at the 3' end of the first strand cDNA synthesis are 
deoxycytidine nucleotides. Most preferably, at least three of the non-templated 
nucleotides at the 3* end of the first strand cDNA synthesis are deoxycytidine 
nucleotides. 
5 

Preferably, the hybrid is a cDNA-mRNA hybrid. 

Typically, the first strand cDNA synthesis is catalysed .by a reverse transcriptase 
using the RNA of the cDNA-mRNA as a template. 

10 

cDNA-RNA COMPLEX 

A cDNA-RNA complex refers to a complex that is formed between RNA and a 
cDNA synthesis oligonucleotide in which the RNA annealing region of the cDNA 
synthesis oligonucleotide is hybridised or substantially hybridised to a 
15 complementary or substantially complementary RNA. 

The RNA template-dependent extension of the cDNA synthesis oligonucleotide 
results in extension of the cDNA synthesis oligonucleotide such that the first-strand 
cDNA synthesis is complementary or substantially complementary to RNA, thus 
20 forming a cDNA-mRN A hybrid. 

Preferably, the complex is a cDNA-mRNA complex. 

SAMPLE 

25 A sample in the context of the present invention may be any entity that comprises 
RNA. 

Any RNA, in purified or non-purified form, may be utilised in the method of the 
present invention, provided that it contains or is suspected to contain the RNA that is 
30 of interest. The desired RNA may be a minor or a major fraction of a complex 
mixture. Accordingly, the present invention is useful not only for producing large 
amounts of one specific nucleic acid sequence, but also for amplifying 
simultaneously one or more different specific nucleic acid sequences. 



-28- 



The RNA - such as cloned RNA or total RNA - may be obtained from any 
prokaryotic or eukaryotic source, for example, bacteria, yeast, viruses, organelles, 
and higher organisms such as plants or animals. RNA may be extracted from blood, 
5 tissue material or cells by a variety of techniques such as those described in Maniatis 
et al., supra. 

Accordingly, the sample may be or may be derived from biological material. 

10 The sample may be a clinical sample - such as a biopsy, microdissected tissue or 
laser-captured cells. Preferably, the sample is a small sample, for example, a small 
biopsy, a fine needle aspirate, a macrodissected tissue, a flow-sorted cell, a laser 
captured microdissected cell or a small number of cells - such as a single cell. 

15 Advantageously, the present invention provides a method for the reproducible and 
robust amplification of small amounts of total RNA. Preferably, the present 
invention is able to amplify 5-50 ng of total RNA, more preferably, the present 
invention is able to amplify 5-25ng total RNA, most preferably, the present 
invention is able to amplify 5 ng or less total RNA with the possibility of further 

20 scope to use even lower amounts. 

5-50ng total RNA equates to the approximate equivalent of 500-5000 cells. 

Total cellular RNA, cytoplasmic RNA, or poly(A)+ RNA may be used. Methods for 
25 preparing total and poly(A)+ RNA are well known and are described generally in 
Sambrook et al. (1989, Molecular Cloning- A Laboratory Manual (2nd Ed.), Vols. 
1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.) and Ausubel et al., 
eds. (1994, Current Protocols in Molecular Biology, vol. 2, Current Protocols 
Publishing, New York). 

30 

Preferably, total RNA is prepared by the techniques described in Chirgwin et al, 
(1987), Chomczynski & Sacchi (1987), Sambrook et al, (1989), or Farrell Jr. (1993), 
and a number of high quality commercial kits are also available. More preferably, 
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total RNA used is prepared using the guanidinium thiocyanate method of Chirgwin 
etal,(1987). 

The integrity of total RNA may be checked using various methods that are known in 
5 the art. By way of example, the RNA may be analysed using RNA gel 
electrophoresis (eg formaldehyde/ agarose gel), or Agilent LabChip. For 
mammalian total RNA, two bands at approximately 4.5 and 1.9 kb should be visible; 
these bands represent 28S and 18S ribosomal RNA respectively, and the ratio of 
intensities of these bands should typically be 1 .5-2.5 : 1 . . 

10 

RNA purification kits for microscale RNA preparation are available from a number 
of commercial suppliers (for example Absolutely RNA™ Nanoprep, Stratagene; 
PicoPure™, Arcturus; RNeasy®, Qiagen; RNAqueous™ Microkit, Ambion). 

15 Generally, the RNA samples are immediately snap-frozen in liquid nitrogen and then 
stored at -80°C until RNA extraction. 

Typically, the total RNAs are treated with RNase-free DNasel, which may be 
obtained from various manufacturers - such as Ambion' s DNA-free™ kit. 

20 

cDNA SYNTHESIS 

In accordance with the present invention, the first and second strand cDNA synthesis 
may be performed in separate tubes. Advantageously, the first and the second strand 
synthesis are performed in the same tube which enhances the synthesis procedure, 
25 maximises recovery of cDNA and makes the method simpler and quicker to 
perform. 

The cDNA synthesis oligonucleotide for first strand cDNA synthesis may be 
hybridised to RNA in a suitable buffer at a temperature between about 60°C and 
30 90°C, preferably about 70°C for about 5 minutes, followed by cooling to about 4 °C, 
before the reverse transcriptase is added. 

. Following the hybridisation of the cDNA synthesis oligonucleotide to RNA, a first 
cDNA strand is synthesised. This first strand of cDNA is preferably produced 
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through the process of reverse transcription, wherein DNA is made from RNA, 
utilising reverse transcriptase following methods that are familiar to a person skilled 
in the art. 

5 Advantageously, any reverse transcriptase may be used in the present invention as 
long as the enzyme adds deoxyribonucleotides to the 3 f terminus following extension 
(Varmus, Science 240: 1427-1435 (1988)) and the enzyme lacks RNase H activity. 

Preferably, the reverse transcriptase lacks RNaseH activity but retains wild-type 
10 polymerase activity such that longer cDNA's can be synthesised. More preferably, 
the reverse transcriptase is Moloney Murine . Leukemia virus (MMLV) reverse 
transcriptase or a mutant thereof. Most preferably, the reverse transcriptase is 
PowerScript™ Reverse Transcriptase (BD Biosciences Clontech). 

15 The amount of reverse transcriptase employed may vary as will be appreciated by a 
person skilled in the art. The reverse transcription is performed by incubation for, 
for example, approximately 1 hour with reverse transcriptase at an appropriate 
temperature, which must be in a temperature range in which the reverse transcriptase 
retains enzyme activity. 

20 

The reaction may be performed between 37°C and 5 5 °C, preferably between 37°C 
and42°C. 

Most preferably, the reaction is performed at optimal enzyme activity - such as at 
25 about 42 °C. 

The reverse transcription reaction may be terminated by heating the reaction 
mixture to 95 °C for about 5 minutes to inactivate the enzyme, optionally, followed 
by chilling on ice. 

30 

The first strand cDNA synthesis step may be modified by including dNTP's - such 
as 1 mM dNTP's - in the initial step in the procedure where the RNA is annealed to 
the cDNA synthesis oligonucleotide, by, for example heating to 70°C for 2 min. 



101162 



-31- 



This inclusion of dNTPs has previously been shown to increase the efficiency in 
another application, namely RT-PCR reactions (Huang, et al 9 (2000)). 
Advantageously, this modification may increase the yield of cRNA, from defined 
small amounts of starting_total RNA. Although the mechanism for this observation 
5 is not known, without wishing to be bound by any particular theory, it may be due to 
stabilisation of RNA-primer hybridisation, and/or may help stabilisation of the 
template switching mechanism. 

Advantageously, this modification allows a reduced number of amplification (eg. 
10 PGR) cycles with limited starting material, and may therefore help minimise any 
distortions to the mRNA distribution. 

Typically, a single cycle of reverse transcription is carried out More than one cycle 
of reverse transcription may be performed (with denaturation in between cycles). 

15 

AMPLIFICATION 

"Amplification" refers to a process for multiplying nucleic acid strands in vitro. 

In a highly preferred embodiment, the amplification method of the present invention 
20 is used for multiplying DNA strands - such as cDNA - in vitro. 

An exemplary technique is PCR, which exponentially amplifies nucleic acid 
molecules. 

25 PCR is described in US 4,683,195 and US 4,683,202. PCR consists of repeated 
cycles of DNA polymerase generated primer extension reactions. The target DNA is 
heat denatured and two oligonucleotides, which bracket the target sequence on 
opposite strands of the DNA to be amplified, are hybridised. These oligonucleotides 
become primers for use with DNA polymerase. The DNA is copied by primer 

30 extension to make a second copy of both strands. By repeating the cycle of heat 
denaturation, primer hybridisation and extension, the target DNA can be amplified a 
million fold or more in about two to four hours. PCR is a molecular biology tool 
which , must be used in conjunction with a detection technique to determine the 
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results of amplification. An advantage of PCR is that it increases sensitivity by 
amplifying the amount of target DNA by 1 million to 1 billion fold in approximately 
4 hours. 

5 PCR may be used in the methods of the present invention as follows. A DNA 
polymerase - such as Taq DNA polymerase - is added to the reaction in addition to 
a single PCR primer that comprises substantially the same sequence as the amplifier 
sequence of the cDNA synthesis oligonucleotide. 

10 In a preferred embodiment, the polymerase that is used is Advantage® 2 Polymerase 
Mix (BD Biosciences Clontech), which allows efficient and accurate amplification 
of cDNA templates by long-distance PCR (Barnes, 1994). The Advantage® 2 
Polymerase Mix contains TITANIUM™ Taq DNA Polymerase, a nuclease-deficient 
N-terminal deletion of Taq DNA Polymerase, and a minor amount of a proofreading 

15 polymerase. Advantage® 2 Polymerase Mix also contains TaqStart™ Antibody 
(BD Biosciences Clontech) to provide automatic hot-start PCR (Kellogg et al., 1994) 
and reduce non-specific priming of template. This combination allows efficient 
amplification of full-length cDNAs with a significantly lower error rate than that of 
conventional PCR (Barnes, 1 994). 

20 

The single PCR primer hybridises substantially to the 3* end of the first strand cDNA 
synthesis (after denaturation) which corresponds to the complementary, sequence of 
the template switching oligonucleotide. The DNA polymerase extends from the 3 ! 
end of the PCR primer resulting in a complementary cDNA second strand synthesis. 
25 In subsequent rounds of PCR, the single PCR primer is able to hybridise to the 3 f 
end of the first strand of the cDNA molecule (and all the amplified copies of the first 
strand) and to the 3 f end of the second strand of the cDNA molecule (and all the 
amplified copies of the second strand), resulting in amplification. 

30 The primer is preferably a single stranded oligodeoxynucleotide. The primer must 
. be sufficiently long to act as a template for the synthesis of extension products in the 
presence of the replicating enzyme. The exact lengths of the primers and the 
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quantities used will depend on many factors, including temperature, degree of 
homology and other conditions. 

For example, when amplifying a specific sequence, the oligonucleotide primer 
5 typically contains .between about 10 and 50 nucleotides, preferably 15-25 or more 
nucleotides, although it may contain fewer nucleotides, depending. For other 
applications, the oligonucleotide primer is typically, but not necessarily, shorter, 
e.g., 7-15 nucleotides. Such short primer molecules generally require cooler 
temperatures to form sufficiently stable hybrid complexes. 

10 

In a preferred embodiment of the present invention, the PGR primer and the cDNA 
synthesis oligonucleotide are provided at the same concentration in the reaction. 

In another preferred embodiment, the cDNA synthesis oligonucleotide and the PCR 
1 5 primer have a concentration of about, for example, 0.5 |LiM. 

The oligonucleotide primers may be prepared using any suitable method, such as, 
for example, the well known phosphotriester and phosphodiester methods, or 
automated embodiments thereof. One method for synthesising oligonucleotides on a 
20 modified solid support is described in US 4,458,066. It is also possible to use a 
primer which has been isolated from a biological source (such as a restriction 
endonuclease digest). 

In a preferred embodiment, the PCR primer comprises the sequence set forth in SEQ 
. 25 ID No. 3. 

PCR amplification is performed using methods that are well known in the art. By 
way of example only, the thermal cycling parameters of the PCR reactions may 
comprise 60 sec at 95°C for hot start, followed by 3 step cycling for a pre- 
30 determined numbers of cycles of denaturation for 15 sec at 95°C, annealing for 30 
sec at 65 °C, and extension for 6 min at 68°C. Reactions may then be held at 4°C in 
the thermal cycler until purification. 
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Advantageously, an improved method for determining the optimal number of 
amplification cycles that are required from a given amount of starting material may 
be utilised. For any amplification method - such as PCR - one of the key factors is 
to perform the minimum number of cycles (for a given amount of starting material) 

5 that will result in sufficient target for the intended downstream application. If too 
many cycles are performed, then it is likely that the representation of RNAs - such 
as mRNAs - could be biased as the amplification reactions are unlikely to be 
identical for each RNA template in the complex mixture, and the amplification 
reaction may also reach a plateau. If too few cycles are performed then insufficient 

1 0 amplification product may be obtained for subsequent applications. 

In the metfiod described here, for a given amount of starting total RNA (eg 50ng or 
5ng), the minimum numbers of amplification cycles that are required are determined 
empirically such that when the entire amplification products are then used for in- 

15 vitro transcription (IVT) reactions to generate cRNA, sufficient cRNA for the 
intended downstream application is obtained. This can be done by starting with a 
given amount of total RNA (eg 5ng or 50ng) by setting up identical reactions and 
performing an amplification reaction for defined numbers of cycles (or by removing 
aliquots from an amplification reaction for analysis and performing additional cycles 

20 on the remainder of the amplification reaction). The entire amplification products 
from each reaction (for example 50ng starting total RNA that has undergone PCR 
thermal cycling for 9, 10, 1 1, 12, 13 or 14 cycles) is purified and used for IVT. The 
number of cycles that gives the minimum amount of cRNA that is sufficient for the 
downstream application is then determined. This number of cycles can then be used 

25 routinely for similar studies with other RNA's at the same starting concentrations. 

Advantageously, the amplification step can also be enhanced by addition of dNTP's 
in the initial step in the procedure where the RNA is heated to 70°C for 2 minutes, 

30 In a preferred embodiment, about 1 mM dNTPs are added to the RNA/primer mix 
when the samples are being denatured prior to reverse transcription. 
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This means that less amplification is required with limited amounts of starting 
material and therefore help minimise any distortions to the mRNA distributions. 

The amplification products that are obtained will typically be purified. This may be 
5 achieved using various methods that are known in the art. By way of example, PCR 
products may be purified using QIAGEN Qiaquick columns as per manufacturer's 
instructions. 

SUBSTANTIALLY 

10 The term "substantially" when used in relation to annealing or hybridisation, means 
that an oligonucleotide — such as a primer - should be sufficiently complementary to 
hybridise or anneal to its respective nucleic acid. 

The oligonucleotide sequence need not reflect the exact sequence of its respective 
1 5 nucleic acid, and can, in fact, be "degenerate". Non-complementary bases or other 
sequences may be interspersed into the oligonucleotide or the nucleic acid, provided 
that the oligonucleotide sequence has sufficient complementarity with the sequence 
to permit hybridisation. Thus, by way of example, the primers used for PCR 
amplification may be selected to be "substantially" complementary to the specific 
20 sequence to be amplified. 

HYBRIDISATION 

As used herein, the term "hybridisation" refers to the process by which a strand of 
nucleic acid joins with a complementary strand through base pairing as well as the 
25 process of amplification as carried out in, for example, polymerase chain reaction 
(PCR) technologies. 

The present invention encompasses the use of nucleotide sequences that are capable 
of hybridising to nucleotide sequences. 

30 

TRANSCRIPTION 

The PCR reaction step described above results in a double stranded T7 RNA 
polymerase promoter that is operably linked to a double stranded DNA sequence. 
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By utilising a double stranded T7 promoter, the DNA sequence that is operably 
linked to the promoter may be transcribed into RNA. Various methods for in vitro 
transcription are well known in the art and many commercial kits are readily 
5 available. 

By way of example only, the ENZO ® BioArray™ HighYield™ RNA Transcript 
Labelling Kit, Affymetrix (900182) may be used. The necessary reagents for the in 
vitro transcription are combined with the PCR reactions and in vitro transcription is 
performed at an appropriate time and temperature — such as 37°C for 5hrs. The 
10 incubation time may be varied depending upon how many transcripts it is desired to 
generate. 




Depending on the ultimate use of the RNA, the necessary ribonucleotide 
triphosphates will be included in the transcription reaction mixture. One or more of 
15 the ribonucleotides may be labelled, with for example, a radioactive label, biotin, or 
the like. A wide variety of labelling techniques are well known to those skilled in 
the art and may be used in accordance with standard procedures, as described in US 
4,755,619, for example. 

20 Once the RNA transcripts have been obtained, various well known procedures may 
be employed for their processing. The transcripts may be removed from the reaction 
mixture and purified using various methods known in the art - such as the RNeasy® 
Mini columns (QIAGEN) as per manufacturers instructions. The aRNA may be 
used as template for cDNA synthesis and subjected to PCR to further expand desired 

25 sequences. The aRNA may be used unmodified for further cloning, expression, use 
as probe or driver nucleic acid in subtractive hybridisation and the like. 

AMPLIFIED RNA 

As used herein, the term amplified RNA (aRNA) is used interchangeably with the 
30 term complementary RNA (cRNA). 

aRNA refers to the amplified antisense RNA that is obtained from in vitro 
transcription of the double-stranded cDNA template using an RNA polymerase. 
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LABEL 

In accordance with the present invention, amplified RNA may be labelled during in 
vitro transcription to facilitate its detection/use in subsequent steps. 

5 

The amplified RNA may be directly labelled with any label that is known in the art, 
including, but not limited to, radioactive labels, fluorophores, chemiluminescent 
molecules, or enzymatic markers — such as those that produce a detectable signal 
when a particular chemical reaction is conducted - and the like. 

10 

Alternatively, ribonucleotides may be obtained which are labelled with, for example, 
biotinylated CTP and UTP, where these ribonucleotides will become incorporated in 
the amplified RNA. The biotin may then be used for binding to avidin, which is 
labelled with an appropriate label capable of providing for detection. Other 
15 modified ribonucleotides - such as cyanine 3 and cyanine 5 CTP and UTP, or 
aminoallyl UTP can be readily incorporated into amplified RNA. A wide variety of 
labelling techniques are well known to those skilled in the art. 

Labelling of RNA may be accomplished by including one or more labelled NTPs in 
20 the in vitro transcription reaction mixture. NTPs may be directly labelled with a 
radioisotope, such as 32 P, 35 S, 3 H. NTPs may be directly labelled with a fluorescent 
label - such as fluorescein isothiocyanate, lissamine, Cy3, Cy5, and rhodamine 110. 

RNA may also be indirectly labelled by incorporating a nucleotide linked covalently 
25 to a hapten or to a molecule - such as biotin - to which a labelled avidin molecule 
may be bound, or digoxygenin, to which a labelled anti-digoxygenin antibody may 
be bound. RNA may be labelled with labelling moieties during chemical synthesis or 
the label may be attached after synthesis by methods known in the art. 

30 Often it is desired to compare gene expression in two different populations of cells, 
perhaps derived from different tissues or perhaps exposed to different stimuli. Such 
comparisons are facilitated by labelling the RNAs from one population with a first 
fluorophore and the RNAs from the other population, with a second fluorophore, 
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where the two fluorophores have distinct emission spectra. Again, Cy3 and Cy5 are 
particularly preferred fluorophores for use in comparing gene expression between 
two different populations of cells. 

5 NUCLEOTIDE SEQUENCE 

As used herein, the term "nucleotide sequence' 5 is synonymous with the term 
"polynucleotide". 

Aspects of the present invention involve the use of nucleotide sequences, which may 
10 be available in databases. 

The nucleotide sequence may be DNA or RNA of genomic or synthetic or 
recombinant origin. The nucleotide sequence may be double-stranded or single- 
stranded whether representing the sense or antisense strand or combinations thereof. 

15 

The nucleotide sequence may be prepared by use of recombinant DNA techniques 
{e.g. recombinant DNA). 

The nucleotide sequence may be the same as the naturally occurring form, or may be 
20 derived therefrom. 




VARIANTS/HOMOLOGUES/DERIVATIVES 

In the context of the present invention, reference to nucleic and amino acid 
sequences includes mutants, variants, homologues, derivatives or fragments thereof. 
25 Moreover, reference to a particular polypeptide includes mutants, variants, 
homologues, derivatives or fragments thereof which have the activity of the 
naturally occurring polypeptide and includes those polypeptides that differ from 
naturally occurring forms by having amino acid deletions, substitutions, and 
additions. 

30 

Thus, the present invention encompasses the use of variants, homologues and 
derivatives of nucleotide and amino acid sequences. Here, the term "homologue" 
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means an entity having a certain homology with amino acid sequences or nucleotide 
sequences. Here, the term "homology" can be equated with "identity". 

In the present context, an homologous sequence is taken to include an amino acid 
5 sequence which may be at least 75, 85 or 90% identical, preferably at least 95 or 
98% identical to the subject sequence. Although homology can also be considered 
in terms of similarity (i.e. amino acid residues having similar chemical 
properties/functions), it is preferred to express homology in terms of sequence 
identity. 

10 

An homologous sequence is taken to include a nucleotide sequence which may be at 
least 75, 85 or 90% identical, preferably at least 95 or 98% identical to the subject 
sequence. 

15 Homology comparisons can be conducted by eye, or more usually, with the aid of 
readily available sequence comparison programs. These commercially available 
computer programs can calculate % homology between two or more sequences. 

% homology may be calculated over contiguous sequences, i.e. one sequence is 
20 aligned with the other sequence and each amino acid in one sequence is directly 
compared with the corresponding amino acid in the other sequence, one residue at a 
time. This is called an "ungapped" alignment. Typically, such ungapped alignments 
are performed only over a relatively short number of residues. 

25 Although this is a very simple and consistent method, it fails to take into 
consideration that, for example, in an otherwise identical pair of sequences, one 
insertion or deletion will cause the following amino acid residues to be put out of 
alignment, thus potentially resulting in a large reduction in % homology when a 
global alignment is performed. Consequently, most sequence comparison methods 

30 are designed to produce optimal alignments that take into consideration possible 
insertions and deletions without penalising unduly the overall homology score. This 
is achieved by inserting "gaps" in the sequence alignment to try to maximise local 
homology. 
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However, these more complex methods assign "gap penalties" to each gap that 
occurs in the alignment so that, for the same number of identical amino acids, a 
sequence alignment with as few gaps as possible - reflecting higher relatedness 

5 between the two compared sequences - will achieve a higher score than one with 
many gaps. "Affine gap costs" are typically used that charge a relatively high cost 
for the existence of a gap and a smaller penalty for each subsequent residue in the 
gap. This is the most commonly used gap scoring system. High gap penalties will 
of course produce optimised alignments with fewer gaps. Most alignment programs 

10 allow the gap penalties to be modified. However, it is preferred to use the default 
values when using such software for sequence comparisons. For example when 
using the GCG Wisconsin Bestfit. package the default gap penalty for amino acid 
sequences is -12 for a gap and -4 for each extension. 

15 Calculation of maximum % homology therefore firstly requires the production of an 
optimal alignment, taking into consideration gap penalties. A suitable computer 
program for carrying out such an alignment is the GCG Wisconsin Bestfit package 
(University of Wisconsin, U.S.A.; Devereux et aL 9 1984, Nucleic Acids Research 
12:387). Examples of other software than can perform sequence comparisons 

20 include, but are not limited to, the BLAST package (see Ausubel et aL 9 1999 ibid - 
Chapter 18), FASTA (Atschul et aL, 1990, J. Mol. Biol., 403-410) and the 
GENEWORKS suite of comparison tools. Both BLAST and FASTA are available 
for offline and online searching (see Ausubel et aL, 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the GCG Bestfit program. A 

25 new tool, called BLAST 2 Sequences is also available for comparing protein and 
nucleotide sequence (see FEMS Microbiol Lett 1999 174(2): 247-50; FEMS 
Microbiol Lett 1999 177(1): 187-8). 

. Although the final % homology can be measured in terms of identity, the alignment 
30 process itself is typically not based on an all-or-nothing pair comparison. Instead, a 
scaled similarity score matrix is generally used that assigns scores to each pairwise 
comparison based on chemical similarity or evolutionary distance. An example of 
such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
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BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied (see user manual for 
further details). For some applications, it is preferred to use the public default values 
for the GCG package, or in the case of other software, the default matrix, such as 
5 BLOSUM62. 

Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part 
of the sequence comparison and generates a numerical result. 

10 

The sequences may also have deletions, insertions or substitutions of amino acid 
residues which produce a silent change and result in a functionally equivalent 
substance. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the 

15 amphipathic nature of the residues as long as the secondary binding activity of the 
substance is retained. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; 
and amino acids with uncharged polar head groups having similar hydrophilicity 
values include leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, 

20 serine, threonine, phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example according to . the Table below. 
Amino acids in the same block in the second column and preferably in the same line 
in the third column may be substituted for each other: 



ALIPHATIC 


Non-polar 


GAP 






ILV 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HFWY 

■ 
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Homologous substitution (substitution and replacement are both used herein to mean 
the interchange of an existing amino acid residue, with an alternative residue) may 
occur i.e. like-for-like substitution such as basic for basic, acidic for acidic, polar for 
polar etc. Non-homologous substitution may also occur i.e. from one class of 
5 residue to another or alternatively involving the inclusion of unnatural amino acids 
such as ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine 
(hereinafter referred to as B), norleucine ornithine (hereinafter referred to as O), 
pyriylalanine, thienylalanine, naphthylalanine and phenylglycine. 

10 Replacements may also be made by unnatural amino acids include; alpha* and 
alpha-disubstituted* amino acids, N-alkyl amino acids*, lactic acid*, halide 
derivatives of natural amino acids such as txifluorotyrosine*, p-Cl-phenylalanine*, 
p-Br-phenylalanine*, p-I-phenylalanine*, L-allyl-glycine*, B-alanine*, L-a-amino 
butyric acid*, L-y-amino butyric acid*, L-a-amino isobutyric acid*, L-e-amino 

15 caproic acid # , 7-amino heptanoic acid*, L-methionine sulfone**, L-norleucine*, L- 
norvaline*, p-nitro-L-phenylalanine*, L-hydroxyproline # , L-thioproline*, methyl 
derivatives of phenylalanine (Phe) such as 4-methyl-Phe*, pentamethyl-Phe*, L-Phe 
(4-amino) # , L-Tyr (methyl)*, L-Phe (4-isopropyl)*, L-Tic (1,2,3,4- 
tetrahydroisoquinoline-3-carboxyl acid)*, L-diaminopropionic acid # and L-Phe (4- 

20 benzyl)*. The notation * has been utilised for the purpose of the discussion above 

( relating—to - homolog ous — Q3^_nnn^inTrin1ngmis substitution), to indicate the 

hydrophobic nature of the derivative whereas # has been utilised to indicate the 
hydrophilic nature of the derivative, #* indicates amphipathic characteristics. 

25 Variant amino acid sequences may include suitable spacer groups that may be 
inserted between any two amino acid residues of the sequence including alkyl 
groups such as methyl, ethyl or propyl groups in addition to amino acid spacers such 
as glycine or (5-alanine residues. 

30 USES 

It will be appreciated that the amplified RNA produced in accordance with the 
present invention represents a useful intermediate for a wide variety of downstream 
applications. 
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cDNA libraries 

The amplified RNA may facilitate the construction of complex cDNA libraries from 
extremely limited amounts of tissue. 

5 

The amplified RNA can be easily converted into double stranded cDNA using 
various methods that are well known in the art. 

Optionally the double stranded cDNA that is generated may be inserted into a 
10 vector. This allows the recombinant DNA molecules comprising the cDNA library 
to be introduced into host cells - such as eukaryotic and prokaryotic hosts. 

Ribonucleotide probes 

The amplified RNA may also be used for the production of specific ribonucleotide 
15 probes without prior cDNA cloning into riboprobe vectors. 

Subtractive hybridisation 

Furthermore, the amplified RNA provides a source of large amounts of single- 
stranded, anti-sense material for use as driver in subtractive hybridization. For 
20 example, two nucleic acid populations, one sense, and one anti-sense, may be 
allowed to mix together with one population present in molar excess (driver). 
Sequences present in both populations will form hybrids, whereas sequences present 
in only one population remain single-stranded. Thereafter, various well known 
techniques are used to separate the unhybridised molecules representing 

25 differentially expressed sequences. Accordingly, the amplified RNA may also be 
applied to improve methods of detecting and isolating nucleic acid sequences that 
vary in abundance among different populations, such as in comparing mRNA 
expression among different tissues or within the . same tissue according to 
physiologic state.. Examples of subtractive hybridisation technologies include 

30 Suppression Subtractive Hybridisation technology (US 5,565,340), representation 
difference analysis (US 5,436,142); and linker capture subtraction (Anal. Biochem. 
(1996) 237:109-114). 
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Anti-sense RNA 

Anti-sense RNA has a wide variety of uses in both analytical research and 
therapeutics. Anti-sense RNA functions in several prokaryotic systems to regulate 
gene expression. Similarly, anti-sense RNA can regulate the expression of many 
5 eukaryotic genes. This permits blocking expression of undesirable genes. 
Therapeutic use of anti-sense RNA therefore involves in vitro synthesis of anti-sense 
RNA with subsequent introduction into the subject (see, generally, Melton, 
Antisense RNA and DNA, Cold Spring Harbor (1988)). 

10 Arrays 

The application of array technology is often limited because substantial amounts of 
RNA are required for target preparation. The present invention is therefore 
particularly suited to the generation of targets for array - such as microarray 
analysis, in particular, Affymetrix arrays. 

15 

Array technology and the various techniques and applications associated with it are 
described generally in numerous textbooks and documents. These include Lemieux 
et ah, (1998), Molecular Breeding 4, 277-289, Schena and Davis. Parallel Analysis 
with Biological Chips, in PCR Methods Manual (eds. M. Innis, D. Gelfand, J. 

20 Sninsky), Schena and Davis, (1999), Genes, Genomes and Chips. In DNA 
Microarrays: A Practical Approach (ed. M. Schena), Oxford University Press, 
Oxford, UK, 1999), The Chipping Forecast {Nature Genetics special issue; January 
1999 Supplement), Mark Schena (Ed.), Microarray Biochip Technology, (Eaton 
Publishing Company), Cortes, 2000, The Scientist 14[17]:25, Gwynne and Page, 

25 Microarray analysis: the next revolution in molecular biology, Science, 1999 
August 6; and Eakins and Chu, 1999, Trends in Biotechnology, 17, 217-218. 

Detection 

In a further aspect, the methods of the present invention may be used to identify one 
30 or more sequences in a sample by detecting the amplified sequences in the amplified 
RNA. 
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Differential amplification 

In a further aspect, the present invention may be used for the detection of 
differentially expressed genes. It is therefore useful for determining the relative 
levels of a given sequence relative to other sequences. 
5 

Such methods may be particularly useful in, for example, molecular diagnostics, 
where diagnosis is not based upon the presence or absence of a sequence, but on the 
relative levels of a given sequence. 

10 KITS 

The materials for use in the methods of the present invention are ideally suited for 
preparation of kits. 

Such a kit may comprise containers, each with one or more of the various reagents 
15 (typically in concentrated form) utilised in the methods, including, for example, 
buffers, the appropriate nucleotide triphosphates (e.g., dATP, dCTP, dGTP and 
dTTP; or rATP, rCTP, rGTP and UTP), reverse transcriptase, DNA polymerase, 
RNA polymerase, and one or more oligonucleotides of the present invention. 

20 Oligonucleotides in containers can be in any form, e.g., lyophilized, or in solution 
(e.g., a distilled water or buffered solution), etc. Oligonucleotides ready for use in 
the same amplification reaction can be combined in a single container or can be in 
separate containers. 

25 The kit optionally further comprises in a separate container an RNA polymerase 
specific to the RNA polymerase promoter, and/or a buffer for PGR, and/or a DNA 
polymerase. 

The kit optionally further comprises a control nucleic acid. 

30 

A set of instructions will also typically be included. 
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GENERAL RECOMBINANT DN A METHODOLOGY TECHNIQUES 
The present invention employs, unless otherwise indicated, conventional techniques 
of chemistry, molecular biology, microbiology, recombinant DNA and immunology, 
which are within the capabilities of a person of ordinary skill in the art. Such 
5 techniques are explained in the literature. See, for example, J. Sambrook, E. F. 
Fritsch, and T. Maniatis, 1989, Molecular Cloning: A Laboratory Manual, Second 
Edition, Books 1-3, Cold Spring Harbor Laboratory Press; Ausubel, F. M. et al. 
(1995 and periodic supplements; Current Protocols in Molecular Biology, ch. 9, 13, 
and 16, John Wiley & Sons, New York, N.Y.); B. Roe, J. Crabtree, and A. Kahn, 
10 1996, DNA Isolation and Sequencing: Essential Techniques, John Wiley & Sons; 
M. J. Gait (Editor), 1984, Oligonucleotide Synthesis: A Practical Approach, Irl 
Press; and, D. M. J. Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: DNA 
Structure Part A: Synthesis and Physical Analysis of DNA Methods in Enzymology, 
Academic Press. Each of these general texts is herein incorporated by reference. 

15 

The invention will now be further described by way of Examples, which are meant 
to serve to assist one of ordinary skill in the art in carrying out the invention and are 
not intended in any way to limit the scope of the invention. 

20 EXAMPLES 

Example 1 \ 

Materials and Methods 

BD Biosciences Clontech's SMART™ technology allows PCR amplification of 1st 
strand DNA by incorporating a priming site at the 5* and 3' ends via the template 
25 switching mechanism. The primer (SEQ ID No. 1): 

5'AAGCAGTGGTATCAACGCAGAGTggccagtgaattgtaatacgactcactatagggaggcg 
g(T)3oVN-3' 

30 a 94-mer, is used to prime cDNA synthesis. The upper case region at the 5' end, is 
identical to the 5 f PCR Primer II A provided in BD Biosciences Clontech's 
SMART™ PCR cDNA Synthesis Kit and this sequence generates the 3' anchor on 
the cDNA for subsequent PCR amplification.. The lower case region is identical to 
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the T7 promoter sequence currently used in the Affymetrix cDNA synthesis primer. 
The T7 promoter sequence is added to allow the generation of labelled cRNA targets 
by in vifro transcription. The (T)30 region will bind to poly A tail of messenger 
RNAs and the 3'-terminal VN clamp (where V is A, G , or C and N is any base) 
5 helps ensure priming of mRNA. This oligonucleotide was purified by 
polyacrylamide gel electrophoresis before use. 

"SMART™ H A" Oligonucleotide (10 uM, BD Biosciences Clontech) (SEQ ID No. 

2): 

10 

S'-AAGCAGTGGTATCAACGCAGAGTACGCGGG-S 1 

and "5' PCR Primer II A" (10 uM, BD Biosciences Clontech) (SEQ ID No. 3): 

15 5 f -AAGCAGTGGTATCAACGCAGAGT-3 , 

are identical sequence to those in BD Biosciences Clontech's SMART™ PCR cDNA 
Synthesis Kit. 

20 All reactions are performed in 0,2ml thin-walled PCR tubes. 
RNA 

The RNA used in these studies was prepared using guanidinium thiocyanate method 
25 [Chirgwin et al, (1987)], with all the usual precautions for handling RNA. 
Cytoplasmic RNA or PolyA+ RNA could also be used in this technique. Integrity of 
total RNA should be checked by analysis of a sample on RNA gel electrophoresis 
(eg formaldehyde/ agarose gel), or Agilent LabChip. For mammalian total RNA, 
two bands at approximately 4.5 and 1.9 kb should be visible; these bands represent 
30 28S and 18S ribosomal RNA respectively, and the ratio of intensities of these bands 
should be 1.5-2.5:1. RNA purification kits for microscale RNA preparation are 
available from a number of commercial suppliers (for example Absolutely RNA™ 
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Nanoprep, Stratagene; PicoPure™, Arcturus; RNeasy®, Qiagen; RNAqueous™ 
Microkit, Ambion). 

Total RNAs for the studies reported here were isolated from two human bladder 
5 tumour biopsies that were obtained from an external collaboration between 
AstraZeneca and Institut Curie, Paris and Henri Mondor Hopital, Creteil, Paris. 
These bladder transitional cell carcinoma biopsies were from a T3 grade 3 (sample 
384) and a superficial Ta grade 1 (sample 842). Bladder biopsies were surgically 
removed by transurethral resection by Dominique Chopin at Henri Mondor Hopital, 
10 Creteil, Paris. Samples were immediately snap-frozen in liquid nitrogen and then 
stored at -80oC until RNA extraction. Total RNA was purified by guanidinium 
thiocyanate extraction followed by cesium chloride gradient ultracentrifugation 
(Chirgwin et al, (1987)) at Institut Curie, Paris, quantitated by A260 measurement 
and the integrity checked by RNA gel electrophoresis. Total RNAs were treated with 
15 RNase-free DNasel (Ambion, DNA-free™ kit) according the manufacturers 
instructions such that final concentrations were lug/ul. . As a reference, the targets 
were also prepared following the standard protocol (Affymetrix GeneChip® 
Expression Analysis Technical Manual) starting with 9ug of the same total RNAs. 

20 DNase I-treated total RNAs were then diluted with RNase-free water, and 5ng or 
50ng of total RNA were then used in each amplification reaction. 



4 RNA control sense polyA+ spikes were added to the total RNAs using a (20x) 
stock consisting of Lys, Phe and Thr from B.subtilis (ATCC) and kanamycin 
25 positive control polyA+RNA (Promega Corporation, Cat C1381). Plasmids for Lys, 
Phe and Thr were obtained from ATCC [pGIBS-Lys, ATCC 87482; pGIBS-Phe, 
ATCC 87483; and pGDBS-Thr ATCC 87484] and sense RNA transcripts were 
generated and purified as described in Affymetrix GeneChip® Expression Analysis 
Technical Manual [http://www.affymetrbc.com/], 

30 

Lys, Phe, Thr and kanamycin spikes were added at final concentrations of 1, 5, 20 
and lpM respectively to the non-amplified RNA samples. 



to 
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Table (1) gives a summary of the probes synthesised and their respective cRNA 
yields. Note that for the conventional Affymetrix standard protocol, RNA / 
T7(dT)24 annealing was undertaken in the absence of dNTPs, which were 
subsequently added to the first strand master mix. 

5 

SMARf M 1st strand cDNA syntheses 

The first and the second strand synthesis are performed in the same tube that 
enhances the synthesis procedure and maximizes recovery of cDNA. The method 
10 described here uses PowerScript™ Reverse Transcriptase (BD Biosciences 
Clontech), a point mutant of Moloney murine leukemia virus (MMLV) reverse 
transcriptase. PowerScript™ lacks RNase H activity, but retains wild-type 
polymerase activity, so longer cDNA fragments can be synthesized than wild-type 
MMLV RT. 

15 

The recommended SMART™ protocol (BD Biosciences Clontech) was modified by 
including 1 mM dNTP's in the initial step in the procedure where the RNA is heated 
to 70°C for 2 min. We show here that this modification increased the yield of cRNA, 
from defined amounts of starting Total RNA (see Table 2 below). This modification 
20 allows a reduced number of PCR cycles with limited starting material, and should 
help minimize any distortions to the mRNA distribution. 

cRNAs synthesised from 50ng or 5ng total RNA. ImM dNTPs were added at the 
RNA-primer annealing step plus the cDNA synthesis mix [dNTPs in steps 1 and 2], 

25 or only in the cDNA synthesis mix [dNTPs in step 2 only]. The numbers of PCR 
cycles required to subsequently generate ~ 20pg of cRNA are shown in Table 2. 
Primer mix was prepared by mixing equal volumes of lpM SMART™ -T7- 
oligo(dT)30VN oligonucleotide SEQ ID 1 and lyiM SMART™ II A oligonucleotide 
SEQ ID 2 (BD Biosciences Clontech) so that both were O.SpM. We have used a 

30 . reduced concentration of both primers [Baugh, et al (2001)] reported that reduced 
concentrations of oligo dT primer reduced non-specific artifacts when using lower 
amounts of template RNA, and we reasoned that reduced concentrations of the novel 
primer SMART™-T7-oligo(dT)30VN oligonucleotide SEQ ID 1 and of SMART™ 
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il A oligonucleotide SEQ ID 2 would be required as we are starting with much less 
total RNA than required in the standard SMART™ protocol. 

3\x\ of Total RNA were mixed with 2|il of 0.5pM primer mix.. RNA / primer was 
5 heated at 70°C for 2 min, then at 4°C for 5 min in a thermal cycler. 

At 4°C, 5]il of First Strand Masteimix was added to each reaction. First Strand 
Mastermix was prepared from components of SMART™ PCR cDNA Synthesis Kit 
(BD Biosciences Clontech, # Kl 052-1) and 2ul 5x First-strand buffer (250mM Tris- 
10 HC1, pH8.3, 30mM magnesium chloride and 375mM potassium chloride), lul 
20mM dithiothreitol (DTT), lul lOmM dNTPs (lOmM each dATP, dCTP, dGTP, 
dTTP), lul PowerScript™ Reverse Transcriptase was added. Reactions were mixed 
by gentle pipetting, then heated for lh at 42°C, then 4°C for 5 min in a thermal 
cycler. 

15 

PCR Amplification of SMART™ synthesised cDNA 



The method described here uses Advantage® 2 Polymerase Mix (BD Biosciences 
Clontech), which allows efficient and accurate amplification of cDNA templates by 
20 long-distance PCR [Barnes, 1994]. AdvantageTM 2 products are described in US 
5,436,149. 



The Advantage® 2 Polymerase Mix contains TITANIUM™ Taq DNA Polymerase, 
a nuclease-deficient N-tenninal deletion of Taq DNA Polymerase, and a minor 
25 amount of a proofreading polymerase. Advantage® 2 Polymerase Mix also contains 
TaqStart™ Antibody (BD Biosciences Clontech) to provide automatic hot-start PCR 
(Kellogg et aL> 1994) and reduce non-specific priming of template. This 
combination allows efficient amplification of full-length cDNAs with a significantly 
lower error rate than that of conventional PCR (Barnes, 1994). 

30 

90]il PCR Master Mix was added to each cDNA synthesis using the same tube as for 
SMART™ cDNA synthesis. 5' PCR IIA primers SEQ ID 3 was identical sequence 
to that used in the BD Biosciences Clontech SMART™ cDNAsynthesis kit (Cat # 
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Kl 052-1). This primer will bind to both ends of the SMART™ cDNA and allow 
PCR amplification. PCR was set up using components form Advantage® 2 
polymerase Kit . (BE) Biosciences Clontech, Cat# 8430-1) with 10ul 10X 
Advantage® 2 PCR Buffer (40mM Tricine-KOH, pH9.2, 15mM potassium acetate, 
5 3,5 mM magnesium acetate), 2ul 5 OX dNTP Mix (10 mM of each dNTP), 2ul lOuM 
5' PCR Primer H A SEQ ID 3, and 2ul SOX Advantage® 2 Polymerase Mix and 74ul 
water. 

Thermal cycling parameters of the PCR reactions were 60 sec at 95°C for hot start, 
10 followed by 3 step cycling for a pre-determined numbers of cycles (eg 1 1 cycles for 
50ng starting total RNA or 15 cycles for 5ng total RNA) of denaturation for 15 sec 
at 95°C, annealing for 30 sec at 65°C, and extension for 6 min at 68°C. Reactions 
were then held at 4°C in the thermal cycler until purification. 50ng reactions were 
typically cycled for 10 or 1 1 cycles, and 5ng reactions for 14 or 15 cycles. 

15 

We have devised an improved method for determining the optimal number of PCR 
cycles that are required from a given amount of starting material. For any PCR- 
based method one of the key factors is to perform the minimum numbers of PCR 
cycles (for a given amount of starting material) that will result in sufficient target for 

20 the intended downstream application. If too many cycles are performed, then it is 
likely that the representation of mRNAs could be biased as the PCR reactions are 
unlikely to be identical for each RNA template in the complex mixture, and PCR 
may also reach a plateau. If too few cycles are performed then insufficient PCR 
product is obtained for subsequent applications. BD Biosciences Clontech 

25 SMART™ methodology recommends an optimisation strategy that involves 
examination of the amplified PCR products generated after different numbers of 
cycles on agarose gel electrophoresis and then visual estimation of the optimal 
amplification. In the method described here, we determine empirically for a given 
amount of starting total RNA (eg 50ng or 5ng), the minimum numbers of PCR 

30 cycles that are required such that when the entire PCR products are then used for in- 
vitro transcription (IVT) reactions to generate cRNA, we obtain sufficient cRNA for 
the intended downstream application. This can be done easily by starting with a 
given amount of total RNA (eg 5ng or 50ng) by setting up identical reactions and 
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performing PCR for defined numbers of cycles (or by removing aliquots form a PCR 
reaction for analysis and performing additional cycles on the remainder of the PCR 
reaction). The entire PCR products from each reaction (for example 50ng starting 
total RNA that has undergone PCR thermal cycling for 9,10,11,12,13 or 14 cycles) 
5 is purified and used for IVT. The number of PCR cycles that gives the minimum 
amount of cRNA that is sufficient for the downstream application is then 
determined. This number of cycles can then be used routinely for similar studies 
with other RNA's at the same starting concentrations. 

10 For example, in this study reported here we describe downstream use on Affymetrix 
GeneChip® probe arrays. If the PCR product was to be used for another downstream 
application then an identical strategy could be used. The cycling parameters in this 
protocol have been optimised using a MJ Research PTC200 thermal cycler. 
However, the numbers of cycles required for different amounts of starting total RNA 

15 should be determined empirically in initial experiments as optimal parameters may 
vary with different templates, thermal cyclers, or if different enzymes are used for 
cDNA synthesis or PCR. Once the optimal conditions are determined for a specific 
amount of starting total RNA, then subsequent experiments with this amount of 
RNA under identical conditions should results in similar yields of product. 

20 

This PCR step can also be enhanced by addition of dNTP's in the initial step in the 
procedure where the RNA is heated to 70oC for 2 minutes (see above for details). 
This means that a reduced number of PCR cycles are required with limited amounts 
of starting material and therefore help mimimise any distortions to the mRNA 
25 distributions. 

PCR products were purified using QIAGEN Qiaquick columns as per manufacturers 
instructions. For the elution: 30pl of elution. buffer was applied to column and left to 
stand for 2min prior to centrifugation. Eluted volumes were adjusted to 30}il with 
30 molecular biology grade water. PCR purification could be performed using other 
PCR purification methods or commercial kits. 



Synthesis of Biotin-Labelled cRNA (IVT) from amplified cDNAs 
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ENZO ® BioArray™ HighYield™ RNA Transcript Labeling Kit, Affymetrix 
900182 was used for IVT, but this could be obtained from a number of other 
suppliers. To each 30pl of purified PCR was added 20pl of Mastermix. [5pl lOx HY 
5 Reaction Buffer, 5pl lOx Biotin Labelled Ribonucleotides, 4pl lOx DDT, 4pl lOx 
RNase Inhibitor Mix, 2pl 20x T7 RNA Polymerase]. IVT reactions were incubated 
at 37°C for 5hrs using Uun-walled 0.2ml PCR tubes. 

50ul water was added per sample and then the entire reactions were purified using 
10 RNeasy® Mini columns (QIAGEN) as per manufacturer's instructions. Purified 
RNA was eluted with 2 sequential applications of 50pl RNase-firee water i.e. lOOpl 
final volume. 

5ul eluted RNA was added to 95ul water per well of a 96-well UV plate (Costar # 
15 3635) and used for A260 measurement (Molecular Devices, Spectramax) to 
quantitate the amount of RNA. 

Fragmentation and Hybridization 

20 Followed the instructions described in the Affymetrix GeneChip® Expression 
Analysis Technical. Manual. 

Data Analysis 

25 Data were analysed for a number of assay performance criteria, in order to evaluate 
the effectiveness of the novel amplification protocol for Affymetrix GeneChip® 
array expression analysis. These criteria were labelled cRNA yield; Standard array 
quality metrics including Raw Q, Background, Scaling Factor, Percent Present Calls, 
and 3' and 5' Signal Intensity ratio of control probe sets; Linearity and sensitivity of 

30 amplification as quantified using spike-in bacterial poly- A controls; Reproducibility; 
Concordance analysis of differential gene expression between the standard 
Affymetrix protocol and the novel amplification protocol; Confirmation of gene 
expression changes by an independent technique (RT-PCR). 
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LabeliedcRNA Yield 

The yield of labelled cRNA is critical because sufficient target needs to be generated 
. 5 for downstream applications for example Affymetrix GeneChip® probe arrays. The 
required amount of labeled cRNA for Affymetrix GeneChip® is 10-15 |ig for each 
genome array. A series of experiments were carried out using different amounts of 
starting material. Total RNAs were extracted from a T3 Grade 3 (sample 384) and a 
superficial Ta Grade 1 (sample 842) transitional cell bladder carcinoma and used as 

10 template for preparing probes. Replicate probes were prepared from 9]ig total RNA 
using the conventional Affymetrix protocol and from 50ng and 5ng total RNA using 
the novel amplification method described here. Table 3 below gives a summary of 
the probes synthesised, the number of thermocycles used and their respective cRNA 
yields, and indicates which probes were used for subsequent fragmentation and 

15 hybridization to Affymetrix GeneChips. Note that for the conventional probe 
syntheses oligo dT annealing was undertaken without the presence of dNTPs. The 
quantity of cRNA obtained was measured by absorbance at 260 nanometers (nm) 
after purification and are plotted (Figure 2) to demonstrate the repeatability of 
amplification reactions. As shown in Figure 2, the quantities of the labeled cRNA 

20 obtained from samples of 5ng or 50 ng total RNA with the novel amplification 
protocol were comparable with the range anticipated from the standard protocol. 

Standard GeneChip® Array Quality Metrics 

25 To further evaluate whether the novel amplification protocol is suitable for preparing 
targets for GeneChip® array expression analysis, 10 pg of cRNA targets, generated 
from the. experiments described previously, were hybridized on the GeneChip® 
Human Genome U133A (HG-U133A) arrays under standard conditions and washed 
using the EukGE-WS2v4 fluidics protocol. After scanning, the chips were subjected 

30 to a visual QC check for excessive background and the presence of staining 
artefacts. The data were then analyzed using Affymetrix Microarray Suite 5.0 (MAS 
5.0) software and various quality control metrics were obtained 
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Table 3 gives a summary of the QC metrics taken from the .RPT files. The scaling 
factors were consistent for all chip hybridisations with no extreme values observed. 
Raw Q, Background, and Scaling Factor values were examined to evaluate the 
overall sample quality with the targets prepared according to the two protocols 
5 (novel amplification protocol and the standard protocol) (see Fig 3). As shown in 
Table 4, comparable values were obtained for all three parameters. For example, the 
Background values were all about 100, as anticipated for typical experiments. The 
Scaling Factors were also within threefold range — even when comparing the data 
from 5 ng of starting material amplified with the novel amplification protocol. 

10 

Affymetrix GeneChip® arrays are designed predominantly with probes selected 
adjacent to the poly-A tail of the mRNA. This design strategy along with the 
inherent generation of shorter fragment from additional amplifications may create 
targets that are skewed to the 3' end. To examine this phenomenon, Affymetrix have 

15 created probe sets for specific maintenance genes (e.g., GAPDH, actin), and these 
probe sets are designed to the 3% middle, and 5' regions of the transcript The 3' 
probe set Signal Intensity can then be compared to the 5' probe set Signal Intensity 
(375' ratio) to evaluate the efficiency of the transcription reaction. As shown in Fig 
4 (top and middle), the 5 ng and 50ng total RNA samples with the novel 

20 amplification protocol described here, produced the 375' ratio for GAPDH and 
Actin genes, of approximately 1, which are equivalent to those samples processed 
with the standard protocol and well within the Affymetrix recommended range of 3. 
Actin transcript represented on the array was longer than the GAPDH gene, with 
1,761 bases (with the 5' probe set within 1 178-1712 bases, and the Middle probe set 

25 within 589-1117 bases), but the 375' ratios were still maintained. We also 
calculated the 3 '/Middle probe set ratio (37M) of the GAPDH and Actin, genes 
(data not shown) which was also very similar to those samples processed with the 
standard protocol. 

30 The Percent Present Calls comparison was used to globally assess the data 
representations. As shown in Fig 4 (bottom), comparable Percent Present Call values 
were obtained with reducing amounts of starting materials. Even at 5 ng of starting 
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total RNA, about 50 percent of the probe sets were still called as Present by the 
Affymetrix MAS5.0 software algorithm. 

Linearity and Sensitivity 

5 

The ability of any amplification protocol to accurately detect differences in 
expression levels is highly dependent on the assay's linearity and sensitivity. 

The novel amplification protocol was evaluated for both parameters by analyzing the 
10 spike-in poly-A control transcripts in a complex sample. 4 spike-in control 
transcripts were spiked into the complex human bladder biopsy total RNA samples 
at various concentrations.. 3 were in Wflro-generated bacterial poly-A controls - lys, 
phe, and thx and the other spike was a commercially available poly-A control RNA 
for kanamycin. Lys, phe, and thx are represented by probe sets on the U133A 
15 GeneChips and so these spikes can be detected when added to RNA samples for 
profiling. Kanamycin spike-in poly-A control transcript can be measured by a 
specific real-time PCR based TaqMan assay. 5 ng and 50 ng of total RNA were 
labelled with the novel amplification protocol, the target was hybridized on HG- 
U133A arrays, and the Signal Intensities for the controls were plotted (Figure 5), and 
20 the three transcripts were detected by all the probes regardless of the method. 



Reproducibility of replicates 

Reproducibility is a key requirement for any amplification protocol, and it is 
25 essential for generating reliable results. Two independent target preparations with 5 
ng and 50 ng of two different total RNAs (RNA IDs 384 and 842) using the novel 
amplification protocol were hybridized to HG-U133A arrays. 

Pearson correlation values (Tables 5 and 6) were calculated for the amplified and 
30 non-amplified probes derived from each sample. The Pearson correlation 
coefficients were all >0.95 thus indicating very good agreement between each set of 
replicates. However, as the standard and novel amplification protocol use different 
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total RNA amounts and different protocols, it is not recommended that results 
obtained from amplified and non-amplified samples are directly compared. 

Data sets for each chip hybridisation were clustered using GeneSpring software 
5 (Redwood City, CA): the clustering dendrogram (Figure 6) shows that the two 
samples were clustered separately and that, as expected, within each sample the 
amplified probes were distinguishable from the non-amplified probes 

Scatter plots of the replicates 

10 

Log scale scatter plots for replicate samples (Figure 7) indicate typical profiles about 
the xy diagonal which was indicative of good reproducibility for the replicates. Most 
of the scatter was noticeable at signal values <500; this is due to mainly to the 
limitations of the technology platform and is independent of probe synthesis method. 

15 

Data analysis 

The numbers of genes changing by more than 2 fold in any direction were identified 
using Spotfire software (Goteborg, Sweden) and summarised in Tables 7 and 8. Log 
20 scale scatter plots in Figure 8 indicate genes changing >2 fold (equivalent to > |logi 0 
0.3|). 

We have compared scatter plots of all the log ratio sets of all the pairwise 
comparisons of non-amplified versus amplified samples to obtain a measure of 

25 relative gene expression for the samples being compared. Figure 9 shows that for all 
the log ratio sets, when compared to Set A, they are positioned similarly along the 
xy diagonal. As a qualitative observation, it is reasonable to assume that the ratios 
sets are all alike - indeed the ratios derived from the novel amplification method are 
no more different than the Set A versus Set B comparison where all data was 

30 obtained using standard protocol. The Pearson correlation coefficients vary from 
0.81 to 0.87 with the highest figure reported for the comparison between non- 
amplified probes and the lowest figure of 0.81 reported for Set A versus Set F. - 
these are all good correlation values for log ratio comparisons. Note that the data 
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presented in Figure 9 were filtered to remove noise i.e. for any given gene where 
signals were <100 across all twelve GeneChip® hybridisations. Table 4 shows that 
the average background signal reported for this experiment was 1 10 ± 22 SD. 



5 Table 6 & Figure 8 show that there were similar numbers of genes changing >2 fold 
for each comparison between samples 384 and 842 (average = 7844 ± 444 SD) 
regardless of the probe synthesis method. In order to determine how many of these 
changes are for the same genes in each set, Set A was used as the reference because 
these data were obtained using standard Affymetrix protocol. However, by taking 

10 the comparison of Set A versus all other sets (Table 6) it can be seen that the number 
of genes in common is less than the figure of 7844 reported above, and that this 
includes Set A versus Set B where all data were obtained using standard Affymetrix 
protocol. Furthermore, it can be seen that there were only 2385 genes in common 
across all five comparisons to Set A. This result is at least in part due to the 

15 limitation of setting an arbitrary cut off value (in this case >2 fold). 

k-means clustering was employed on the data sets using Spotfire software with 
default settings and arbitrarily chosen to calculate 70 clusters. The data sets were left 
intact with no background signal filtering employed. The frequency histograms 
20 (Figure 10) depict the numbers of genes in each cluster and whether they were 
ranging >2 fold in yyhicfr case the y are coloured blue. Note that the largest cluster 
contains 5488 genes, and that the first four clusters comprise approximately 60% of 
all genes. Collectively the blue clusters in the lower histogram represent the 2385 
genes changing >2 fold in common - see Table 7. 

25 

Attention will now be paid to selected clusters so as to illustrate that the gene 
transcript profiles can be accurately captured using the CPA method. Firstly, some 
pointers to help with analysis of the clusters presented in Figure 12 (all genes), 
Figure 13 (genes <2 fold) and Figure 14 (genes >2 fold):- 

30 

• clusters are numbered consecutively, left to right, proceeding 
downwards, from 1 to 70 (the cluster title bar should just about be 
legible when printed about) 
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y axis of each cluster is logio scale 

x axis is represented in this order, left to right: 384 5ng repl, 384 50ng 
rep2, 384 5ng repl, 384 5ng rep2, 384 9pg repl, 384 9pg rep2, 842 
5ng repl, 842 50ng rep2, 842 5ng repl, 842 5ng rep2, 842 9pg repl, 
842 9pg rep2 



10 



data points 1-6 should be similar horizontally or data points 7-12 
should be similar horizontally 



As an example, regarding the intentionally made sparse clusters in Figure 1 1 we can 
say that:- 



15 O 


cluster 1 


O 


cluster 2 


O 


cluster 3 


O 


cluster 4 


O 


cluster 5 


20 O 


cluster 6 


O 


cluster 7 


o 


cluster 8 


o 


cluster 9 



signals show no pattern and are low in magnitude i.e. <100 

one gene down-regulated in 842, other gene shows no alteration 

gene shows no meaningful alteration between samples 

one gene up-regulated in 842, other gene shows no alteration 

three genes down-regulated in 842 

no discernible pattern 

no change in gene expression 

as for cluster 7 

three genes down-regulated in 384 



25 Looking at the clusters generally (Figure 12) it can be seen that there are some odd 
shaped ones including the "W" shapes in clusters 27, 48, and 62. However the log 
ratios reported for these genes, which are often ribosomal in nature, do not indicate 
any differential gene expression — and from a biological perspective this is consistent 
because the ribosomal species are not expected to vary significantly anyway, and 

30 this is likely to be artefact of the novel amplification protocol whereby some 
amplification of non-polyadenylated ribosomal RNAs is occurring but which in the 
authors' opinion can be considered to have an entirely neutral impact on the data: it 
is the gene expression ratios that are important. Moreover we can take into 
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consideration the fact that the total numbers of these genes are very small, and that 
they are nearly all coloured red i.e. changing <2 fold - it is left to the reader to refer 
to the histograms in Figure 10 to confirm these facts. As stated previously the first 
four clusters comprise about 60% of the genes on the GeneChip® and in the case of 

5 cluster 1 all the genes demonstrate very low signal magnitudes (~<100). This is very 
close to the detection threshold of the Affymetrix technology and signals in this 
region may or may not be due to real gene expression regardless of the method of 
probe synthesis. Figure 1 1 shows clearly genes demonstrating meaningful changes 
in expression, and which behave similarly for the novel amplification protocol and 

10 Affymetrix standard protocol derived data, will be of the shapes in the example 
clusters numbered 5 and 9. Now by inspecting the main body of data by referral to 
Figure 13, which shows clusters containing genes changing >2 fold, it is obvious 
that that for the majority of data where differential expression occurs then the novel 
amplification protocol and Affymetrix standard protocols are very alike. It is this 

15 qualitative analysis, along with the scatter plots of log ratios and the QC metrics, 
which demonstrate that the novel amplification protocol can generate transcript 
profiles from 5-50ng total RNA with acceptable maintenance of profile integrity. 

Several gene changes have been confirmed by RT-PCR using the same Total RNAs 
20 (384 and 842) and are concordant with Affymetrix GeneChip® results (data not 
shown). 



Conclusions 

25 We found that the novel amplification protocol is suitable for robustly amplifying 
and labelling as low as 5 ng of total RNA for expression profiling. The assay 
demonstrated good cRNA yield, sensitivity, and reproducibility. In the work 
exemplified here we have focused downstream application of the cRNA, that is 
generated via the novel amplification method, for Affymetrix GeneChip® probe 

30 arrays. However, the cRNA could be used for other downstream applications, 
including other gene expression profiling platforms. The results closely approximate 
the standard Affymetrix method thus maintaining the integrity of the transcript 
profile and the QC metrics are comparable to those obtained using standard protocol. 
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The protocol provides scope for further improvement in particular the number of 
thermocycles could be reduced further still so that the final yield of biotinylated 
cRNA is approximately lOpg since this is the minimum quantity actually required 
for hybridisation to an Asymetrix GeneChip®. 

5 

As described here, there is also scope to use lower amounts of total RNA. 



All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and 

10 system of the invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such 
specific embodiments. Indeed, various modifications of the described modes for 

15 carrying out the invention which are obvious to those skilled in molecular biology or 
related fields are intended to be within the scope of the following claims. 



20 
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Table 1 



Sample 


Addition of dNTPs during 
annealing 


Method 


384 50ng amp repl 


yes 


15 cycles PCR + IVT 


384 50ng amp rep2 


yes 


15 cycles PCR + IVT 


384 5ng amp repl 


yes 


11 cycles PCR + IVT 


384 5ngamprep2 


yes 


11 cycles PCR + IVT 


384 9|ig non-amp 
repl 


no 


iv i omy 


384 9pg non-amp 
rep2 


no 


T\7T nnlv 
• 1 V 1 vJJLLL V 


842 50ng amp repl 


yes 


15 cycles PCR + IVT 


842 50ng amp rep2 


yes 


15 cycles PCR + IVT 


842 Sngamp repl 


yes 


11 cycles PCR + IVT 


842 5ng amp rep2 


yes 


11 cycles PCR + IVT 


842 9pg non-amp 
repl 


no 


IVTonly 


842 9]ig non-amp 
rep2 


no 


IVTonly 



5 
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Table 2 

The effect of dNTP addition at the primer-annealing step on yield of cRNA 





dNTPsin 


dNTPsin 


Starting Total RNA 


step 2 only 


steps 1 and 






2 


50ng Total RNA 


12 


10 


5ng Total RNA 


16 


14 
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Probe syntheses cRNA yield and number of thermocycles used. 



No. 


sample 


PCR 
cycles 


IV 
T 


Yield cRNA 
(Mg) 




1 


50ng 842 repl 


10 


Ye 

s 


20 


Not used 


2 


50ng 842 rep2 


10 


cc 


21 


cc 


3 


50ng 842 repl 


11 


cc 


37 


Used for fragmentation + 
hyb 


4 


50ng 842 rep2 


11 


cc 


36 


cc 


5 


50ng 384 repl 


10 


ct 


21 


Not used 


6 


50ng 384 rep2 


10 


cc 


20 


cc 


7 


50ng 384 repl 


11 


(« 




Used for fragmentation + 
hyb 


8 


50ng 384 rep2 


11 


cc 


32 


cc 


9 


5ng 842 repl 


14 


cs 


18 


Not used 


10 


5ng 842 rep2 


14 


cc 


17 


cc 


1 1 
1 1 


^■nrr 0/10 t/^t-iI 

Dng o'tz repi 


1 *T 

1j 


e< 




Used for fragmentation + 
hyb 


12 


5ng 842 rep2 


15 


c< 


28 


cc 


13 


5ng 384 repl 


14 




21 


Not used 


14 


5ng 384 rep2 


14 


cc 


16 


cc 


15 


5ng 384 repl 


15 


cc 


26 


Used for fragmentation + 
hyb 


16 


5ng 384 rep2 


15 


cc 


28 


cc 


17 


842 9pg repl 


0 


cc 


80 


cc 


' 18 


842 9pgrep2 


0 


cc 


85 


cc 


19 


384 9pg repl 


0 


cc 


87 


cc 


•20 


384 9ug rep2 


0 


cc 


91 


cc 



5 
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Table4 

Array Quality Metrics Comparisons 



sample 


Raw Q 


*Backgroun 
d 


Scaling Factor 


gapdh 
375' 


actin375' 


384 50ng amp repl 


4.32 


115.14 


1.96 


0.80 


0.97 


384 50ng amp rep2 


4.06 


97.83 


2.43 


0.96 


0.78 


384 5ng amp repl 


4.10 


92.74 


2.66 


1.01 


0.84 


384 5ng amp rep2 


5.99 


172.10 


1.94 


1.03 


0.91 


384 9pg non-amp 
repl 


4.16 


99.42 


1.99 


0.86 


1.08 


384 9pg non-amp 
rep2 


5.00 


116.93 


2.00 


1.10 


1.11 


842 50ng amp repl 


4.41 


108.02 


2.35 


0.97 


0.92 


842 50ng amp rep2 


4.13 


98.75 


2.30 


0.96 


0.82 


842 5ng amp repl 


4.04 


98.89 


2.57 


0.88 


0.83 


842 5ng amp rep2 


3.94 


94.44 


2.30 


0.94 


0.86 


842 9ug non-amp 
repl 


4.26 


99.84 


2.75 


0.91 


0.96 


842 9pg non-amp 
rep2 


5.00 


129.86 


2.25 


0.92 


1.06 



5 * Average Background = 1 1 0 ± 22 SD 

The targets generated from the standard protocol or the novel amplification protocol were 
hybridised to U133A Human Genome Genechips under standard conditions and washed using 
the EukGE-WS2v4 fluidics protocol. Table 2 gives a summary of the QC metrics taken from 
10 the Affymetrix MAS5.0 .rpt files. All of the metrics are within the acceptable range of 
values. 
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Table 5 

Pearson correlation matrix for the biopsy "842" samples compared against each other. 



# 



Sample 842 


50ng amp 
rl 


50ng amp 
r2 


5ng amp 
rl 


5ng amp 9ug non- 9ug non- 
r2 amp rl amp r2 


50ng amp 
rl 


1.00 








50ng amp 
r2 


0.99 


1.00 






5ng amp rl 


0.99 


0.99 


1.00 




5ng amp r2 


0.99 


0.99 


0.99 


1.00 


9ug non- 
amp rl 


0.95 


0.95 


0.95 


0.95 1.00 


9ug non- 
amp r2 


0.96 


0.95 


0.95 


0.95 0.99 1.00 



Table 6 



Pearson correlation matrix for the biopsy "384" samples compared against each other. 



Sample 384 


50ng amp 
rl 


50ng amp 
r2 


5ng amp 
rl 


5ng amp 
r2 


9ug non- 9pg non- 
amp rl amp r2 


50ng amp rl 


1.00 










50ng amp r2 


0.98 


1.00 








5ng amp rl 


0.98 


0.99 


1.00 






5ng amp r2 


0.98 


0.99 


0.99 


1.00 




9ug non- 
amp rl 


0.96 


0.96 


0.96 


0.95 


1.00 


9ug non- 
amp r2 


0.96 


0.96 


0.96 


0.96 


0.99 1.00 



10 



V 
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Table 7 

Genes changing >2 fold for each set of replicates. 



5 



Set 


Comparison 


Genes changing > 2 fold 


A 


9ug 384 non amp repl versus 9ug 842 non amp repl 


7440 


B 


9pg 384 non amp rep2 versus 9ug 842 non amp rep2 


7477 


C 


50ng 384 amp repl versus 50ng 842 amp repl 


7620 


D 


50ng 384 amp rep2 versus 50ng 842 amp rep2 


7744 


E 


5ng 384 amp repl versus 5ng 842 amp repl 


8508 


F 


5ng 384 amp rep2 versus 5ng 842 amp rep2 


8275 



Table 8 



10 Intersection of sets demonstrating number of gene expression changes in common for each 

pairwise comparison 



Set comparisons > 2 fold common to both sets Common to all 5 comparisons 



AandB 


4737 




AandC 


4661 




AandD 


4738 


2385 


AandE 


4803 




AandF 


4678 





15 
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SEOUENCES 
SEP ID No 1 

5 5'AAGCAGTGGTATCAACGCAGAGTGGCCAGTGAATTGTAATACGACTCA 
CTATAGGGAGGCGG(T) 30 VN-3' 

where V is A, G , or C and N is any base 

10 

SEP ID No. 2 (described in US 5,962,271 and US 5,962,272) 
5-AAGCAGTGGTATCAACGCAGAGTACGCGGG-3' 
15 SEP ID No. 3 ( described in US 5,962,271 and US 5,962,272) 
5'-AAGCAGTGGTATCAACGCAGAGT-3' 
SEP ID No 4 

20 

5'AAGCAGTGGTATCAACGCAGAGTAATACGACTCACTATAGGGAGA(T)24 

VN-3' * — - — — — ~ " ~ ~ 

wherein V is A, G , or C and N is any base. 

25 

SEP ID No. 5 

5' GCATTAACCCTCACTAAC 3' 
30 SEP ID No. 6 

5' TAATACGACTCACTATA 3' 
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SEOIDNo.7 

5' AATACGACTCACTATAGGGAGA 3' 
SEP ID NO. 8 

5' GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG 3' 
SEP ID No. 9 

5' ATTTAGGTGACACTATA.3' 
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CLAIMS 



1 . A cDN A-RN A hybrid comprising a first strand cDNA synthesis substantially 
hybridised to RNA wherein the cDNA comprises an amplifier sequence and an RNA 

5 annealing region operably linked to an RNA polymerase promoter, and wherein at 
least one non-templated nucleotide at the 3' end of the first strand cDNA is 
hybridised to a template switching oligonucleotide. 

2. A cDNA-RNA hybrid according to claim 1 wherein the RNA is mRNA. 

10 

3. A cDNA-RNA hybrid according to claim 1 or claim 2 wherein the RNA 
polymerase promoter is a bacteriophage promoter. 

4. A cDNA-RNA hybrid according to claim 3 wherein the bacteriophage 
15 promoter is selected from the group consisting of T7, T3 and SP6. 

5. A cDNA-RNA hybrid according to claim any one of the preceding claims 
wherein the RNA annealing region comprises poly (dT). 

20 6. A cDNA-RNA hybrid according to claim 5 wherein the oligo(T) region is 
from about 10 to about 30 T residues in length. 



7. A cDNA-RNA hybrid according to any one of the preceding claims wherein 
the 3' end of the RNA annealing region comprises a VN clamp, wherein V is A, G 

25 orCandNisA, G, CorT. 

8. A cDNA-mRNA hybrid according to any one of the preceding claims 
wherein at least one non-templated nucleotide at the 3' end of the first strand cDNA 
synthesis is deoxycytidine. 



wherein at least three non-templated nucleotide at the 3 5 end of the first strand 
cDNA.synthesis are hybridised to a template switching oligonucleotide. 



30 



9. 



A cDNA-mRNA hybrid according to any one of the preceding claims 
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10. A cDNA-mRNA hybrid according to any one of the preceding claims 
wherein at least three of the non-templated nucleotides at the 3' end of the first 
. strand cDNA synthesis are deoxycytidine nucleotides. 

5 

11. A cDNA-mRNA hybrid according to any one of the preceding claims 
wherein the template switching oligonucleotide has at least three guanine residues at 
its 3' end. 

10 12. A cDNA-mRNA hybrid according to any one of the preceding claims 
wherein the amplifier sequence, the amplification primer and the template switching 
oligonucleotide contain the same sequence. 

13. A cDNA-mRNA hybrid according to any one of the preceding claims 
15 wherein the 3' end of the first strand cDNA synthesis is extended such that it is 

substantially complementary to the template switching oligonucleotide. 

14. A cDNA-mRNA hybrid according to any one of the preceding claims 
wherein the first strand cDNA synthesis is synthesised by a reverse transcriptase. 

20 

15. A cDNA-mRNA hybrid according to claim 14 wherein the reverse 
transcriptase lacks RNaseH activity but retains wild-type polymerase activity. 

16. A cDNA-mRNA hybrid according to claim 14 or claim 15 wherein the 
25 reverse transcriptase is a Moloney Murine Leukemia virus (MMLV) reverse 

transcriptase or a mutant thereof. 

17. A cDNA-mRNA hybrid according to any one of claims 14 to 16. wherein the 
reverse transcriptase is PowerScript™ Reverse Transcriptase (BD Biosciences 

30 Clontech). 

18. A method for amplifying RNA in a sample comprising the steps of: 
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(a) providing a cDNA synthesis oligonucleotide comprising an amplifier 
sequence and an RNA annealing region operably linked to an RNA polymerase 
promoter; 

5 (b) annealing the RNA annealing region of the cDN A synthesis oligonucleotide 
to RNA under suitable conditions to produce a cDNA-RNA complex; 

(c) incubating said cDNA-RNA complex under conditions which permit 
template-dependent extension of the cDNA synthesis oligonucleotide to generate an 

10 cDNA-RN A hybrid; 

(d) contacting said cDNA-mRNA hybrid with a template switching 
oligonucleotide under conditions which permit template dependent extension of said 
cDNA, such that the 3' end of the cDNA comprises a sequence complementary to 

15 said template switching oligonucleotide; 

(e) providing an amplification primer under conditions to generate double 
stranded amplification products corresponding substantially to the first strand cDNA 
synthesis, such that the cDNA amplification products comprise a double stranded 

20 RNA polymerase promoter; and 



(f) incubating said cDNA amplification products comprising said double 
stranded RNA polymerase promoter under conditions that permit in vitro 
transcription to generate amplified RNA. 

25 

19. A method according to claim 18 wherein said cDNA-RNA hybrid is 
incubated with a reverse transcriptase that adds at least one deoxycytidine residue to 
the 3 5 end of the first strand cDNA synthesis. 



30 20. A method according to claim 18 or claim 19 wherein at least three non- 
templated nucleotide at the 3 9 end of the first strand cDNA synthesis are hybridised 
to a template switching oligonucleotide. 



V 
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21. A method according to any one of claims 18 to 20 wherein the at least three 
of the non-templated nucleotides at the 3' end of the first strand cDNA synthesis are 
deoxycytidine residues. 

5 22. A method according to any one of claims 18 to 21 wherein the reverse 
transcriptase lacks RNaseH activity but retains wild-type polymerase activity. 

23. A method according to any one of claims 18 to 22 wherein the reverse 
transcriptase is a Moloney Murine Leukemia virus (MMLV) reverse transcriptase or 

1 0 a mutant thereof. 

24. A method according to any one of claims 18 to 23 wherein the reverse 
transcriptase is PowerScript™ Reverse Transcriptase (BD Biosciences Clontech). 

15 25. A method according to any one of claims 18 to 24 wherein said template 
switching oligonucleotide comprises at least three ribonucleotide residues. 

26. A method according to any one of claims 18 to 25 wherein said template 
switching oligonucleotide comprises at least three guanine residues. 

20 

27. A method according to any one of claims 18 to 26 wherein said amplification 
primer has the same sequence as the amplifier sequence of said cDNA synthesis 
oligonucleotide. 

25 28. A method according to any one of claims 18 to 27 wherein the reaction 
comprises 1 mM dNTPs. 

29. A method according to any one of claims 18 to 28 wherein the double 
stranded amplification products are obtained by PCR. 

30 

30. A method according to any one of claims 18 to 29 wherein the cDNA 
synthesis oligonucleotide and the PCR primer have the same concentration 
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31. A method according to any one claims 18 to 30 wherein PCR amplification 
is performed using Advantage® 2 Polymerase mix (BD Biosciences Clontech). 

32. A method according to any one of claims 18 to 31 wherein the optimum 
5 number of cycles to generate the double stranded amplification products is 

determined by a method comprising the steps of: 

(a) providing a plurality of samples with a known amount of RNA; 

10 (b) performing amplification for a defined number of cycles on the plurality of 
samples; 

(c) purifying the double stranded amplification products; 

15 (d) providing for the in vitro transcription of the purified amplification products; 
and 

(e) determining the number of amplification cycles that results in the minimu m 
amount of amplified RNA that is required. 

20 

33. A method according to any one of claims 18 to 32 wherein the RNA sample 
is a clinical sample selected from the group consisting of a biopsy, a microdissected 
tissue, a fine needle aspirate, a flow-sorted cell, a laser captured microdissected cell 
or a single cell. 

25 

34. A method for preparing an expression library of a cell or a cell population 
comprising the steps of: 

(a) providing a cDNA synthesis oligonucleotide comprising an amplifier 
30 sequence and an RNA annealing region operably linked to an RNA polymerase 
promoter 
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(b) contacting said cDNA synthesis oligonucleotide with a population of 
mRNAs from said cell or cell population under conditions to allow hybridisation of 
said cDNA synthesis oligonucleotide to mRNA to produce a cDNA-mRNA 
complex; 

5 

(c) incubating said cDNA-mRNA complex under conditions which permit 
template-dependent extension of said cDNA synthesis oligonucleotide to generate a 
cDNA-mRNA hybrid; 

10 (d) contacting said cDNA-mRNA hybrid with a template switching 
oligonucleotide under conditions which permit template dependent extension of said 
cDNA, such that the 3' end of the cDNA of the cDNA-mRNA hybrid comprises a 
sequence complementary to said template switching oligonucleotide; 

15 (e) contacting an amplification primer with said cDNA-mRNA hybrid under 
conditions that generate double stranded amplification products corresponding to the 
first strand cDNA synthesis, such that the double stranded cDNA amplification 
products comprise a double stranded RNA polymerase promoter; and 

20 (f) incubating said double stranded cDNA amplification products comprising 
said double stranded RNA polymerase promoter under conditions that permit in 
vitro transcription to generate amplified RNA. 

35. A method of preparing a cDNA library from a collection of mRNA 
25 molecules comprising the steps of: 

(a) providing a cDNA synthesis oligonucleotide comprising an amplifier 
sequence and an RNA annealing region operably linked to an RNA polymerase 
promoter; 

30 

(b) contacting said cDNA synthesis oligonucleotide with the collection of 
mRNAs under conditions to allow annealing of said cDNA synthesis 
.oligonucleotide to mRNA produce a cDNA-mRNA complex; 
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(c) incubating said cDNA-mRNA complex under conditions which permit 
template-dependent extension of said cDNA synthesis oligonucleotide to generate a 
cDNA-mRNA hybrid; 

5 

(d) contacting said cDNA-mRNA hybrid with a template switching 
oligonucleotide under conditions which permit template dependent extension of said 
cDNA of said hybrid, such that the 3' end of the cDNA of the cDNA-mRNA hybrid 
comprises a sequence complementary to said template switching oligonucleotide; 

10 

(e) contacting a PCR primer with said cDNA-mRNA hybrid under conditions 
that generate double stranded amplification products corresponding to the first strand 
cDNA synthesis, such that the double stranded cDNA amplification products 
comprise a double stranded RNA polymerase promoter; 

15 

(f) incubating said double stranded cDNA amplification products comprising 
said double stranded RNA polymerase promoter under conditions that permit in 
vitro transcription to generate amplified RNA; and 

20 (g) preparing a cDN A library from the amplified RNA. 

36. A method for performing subtractive hybridisation comprising the steps of: 

(a) providing a cDNA synthesis oligonucleotide comprising an amplifier 
25 sequence and an RNA annealing region operably linked to an RNA polymerase 

promoter; 

(b) contacting the cDNA synthesis oligonucleotide with a collection of mRNAs 
under conditions to allow annealing of said cDNA synthesis oligonucleotide to 

30 mRNA in said RNA sample to produce a cDNA-mRNA complex; 
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(c) incubating said cDNA-mRNA hybrid with enzyme, dNTPs and buffer under 
conditions which permit template-dependent extension of said cDNA synthesis 
oligonucleotide to generate a cDNA-mRNA hybrid; 

5 (d) contacting said cDNA-mRNA hybrid with a template switching 
oligonucleotide under conditions which permit template dependent extension of said 
cDNA of said hybrid, such that the 3' end of the cDNA of the cDNA-mRNA hybrid 
comprises a sequence complementary to said template switching oligonucleotide; 

10 (e) contacting an amplification primer with said cDNA-mRNA hybrid under 
conditions to generate double stranded amplification products corresponding to the 
first stand cDNA synthesis, such that the double stranded cDNA amplification 
products comprise a double stranded RNA polymerase promoter; 

15 (f) incubating said double stranded cDNA amplification products comprising 
said double stranded RNA polymerase promoter under conditions that permit in 
vitro transcription to generate amplified RNA; 

(g) contacting said amplified RNA with a single stranded nucleic acid 
20 population in the opposite sense to said amplified RNA; 

(h) providing for the hybridisation of the sequences present in the amplified 
RNA and the single stranded nucleic acid population; and 

25 (i) isolating the nucleic acid population that remains single stranded. 

37. A method for detecting the expression of a gene of interest comprising the 
steps of: 

30 (a) providing a cDNA synthesis oligonucleotide comprising an amplifier 
sequence and an RNA annealing region operably linked to an RNA polymerase 
promoter, wherein the RNA annealing region comprises a sequence that is 
substantially homologous to the mRNA expressed by the gene of interest; 
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(b) contacting said cDNA synthesis oligonucleotide with a population of 
mRNAs in a cell or cell population under conditions to allow annealing of said 
cDNA synthesis oligonucleotide to mRNA to produce a cDNA-mRNA complex; 

5 

(c) incubating said cDNA-mRNA hybrid under conditions which permit 
template-dependent extension of said cDNA synthesis oligonucleotide to generate a 
cDNA-mRNA hybrid; 

10 (d) contacting said cDNA-mRNA hybrid with a template switching 
oligonucleotide under conditions which permit template dependent extension of said 
cDNA of said hybrid, such that the 3' end of the cDNA of the cDNA-mRNA hybrid 
comprises a sequence complementary to said template switching oligonucleotide; 

15 (e) contacting an amplification primer with said cDNA-mRNA hybrid under 
conditions to generate double stranded amplification products corresponding to the 
first stand cDNA synthesis, such that the double stranded cDNA amplification 
products comprise a double stranded RNA polymerase promoter; 

20 (f) incubating said double stranded cDNA amplification products comprising 
said double stranded RNA polymerase promoter under conditions that permit in 
vitro transcription to generate amplified RNA; and 

(g) determining the presence or absence of amplified RNA, which amplified 
25 RNA is complementary to mRNA corresponding to the gene of interest. 

38. Amplified RNA obtainable by the method according to any one of claims 1 8 
to 33. 

30 39. An expression library obtainable by the method according to claim 34. 

40. A cDNA library obtainable by the method according to claim 35. 
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4L Use of a cDNA-mRNA hybrid according to any one of claims 1-17 in the 
amplification of RN A. 

42. Use of a cDNA-mRNA hybrid according to any one of claims 1-17 in the 
5 preparation of a cDNA library. 

43. Use of a cDNA-mKNA hybrid according to any one of claims 1-17 in 
subtractive hybridisation. 

10 44. Use of a cDNA-mRNA hybrid according to any one of claims 1-17 for 
measuring gene expression. 

45. The use according to claim 44 wherein the gene expression is measured 
using a microarray. 

15 

46. A kit for the amplification of RNA in a sample comprising: 

(a) a cDNA synthesis oligonucleotide comprising an amplifier sequence and an 
RNA annealing region operably linked to an RNA polymerase promoter; 

20 

(b) a template switching oligonucleotide that has substantially the same 
sequence as the amplifier sequence; and 

(c) an amplification primer that has substantially the . same sequence as the 
25 template switching oligonucleotide. 

47. The kit according to claim 46, wherein the kit further comprises in a separate 
container a reverse transcriptase. 

30 48. The kit according to claim 47 wherein the reverse transcriptase lacks 
RNaseH activity but retains wild-type polymerase activity. 
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49. The kit according to claim 47 or claim 48 wherein the reverse transcriptase is 
a Moloney Murine Leukemia virus (MMLV) reverse transcriptase or a mutant 
thereof. 

5 50. The kit according to any one of claims claim 47 to 49 wherein the reverse 
transcriptase is PowerScript™ Reverse Transcriptase (BD Biosciences Clontech). 

51. The kit according to any one of claims 47 to 50, wherein the kit further 
comprises in a separate container an RNA polymerase specific to the RNA 

10 polymerase promoter of the cDNA synthesis oligonucleotide. 

52. The kit according to any one of claims claim 47 to 51, wherein the RNA 
polymerase promoter is selected from a T7, T3 or SP6 RNA polymerase promoter. 

15 53. The kit according to any one of claims 47 to 52, wherein the kit further 
comprises an amplification buffer and one or more amplification enzymes. 

54. The kit according to claim 53, wherein the amplification buffer and the 
amplification enzyme(s) are PCR amplification buffer and PGR amplification 
20 enzyme(s). 



55. The kit according to any one of claims 47 to 54, wherein the kit further comprises a 
control nucleic acid. 
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